Flax Community

non-profit

https://github.com/huggingface/transformers/tree/master/examples/research_projects/jax-projects

AI & ML interests

JAX, Flax, TPU, 🤗

Recent Activity

stefan-it authored a paper about 1 month ago

SindBERT, the Sailor: Charting the Seas of Turkish NLP

stefan-it authored a paper about 2 months ago

The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models

vumichien authored a paper about 2 months ago

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

View all activity

christopher

authored a paper 6 days ago

Economies of Open Intelligence: Tracing Power & Participation in the Model Ecosystem

Paper • 2512.03073 • Published 13 days ago • 4

4rtemi5

authored 2 papers 14 days ago

On Space Folds of ReLU Neural Networks

Paper • 2502.09954 • Published Feb 14

The Space Between: On Folding, Symmetries and Sampling

Paper • 2503.08502 • Published Mar 11

christopher

authored a paper about 2 months ago

The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models

Paper • 2510.13996 • Published Oct 15 • 8

thomwolf

authored a paper about 2 months ago

Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14 • 115

vumichien

authored 2 papers about 2 months ago

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

Paper • 2509.25531 • Published Sep 29 • 7

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9 • 36

christopher

posted an update 2 months ago

Post

518

Something very cool is cooking at

1 reply

·

versae

authored a paper 2 months ago

BOE-XSUM: Extreme Summarization in Clear Language of Spanish Legal Decrees and Notifications

Paper • 2509.24908 • Published Sep 29 • 2

mariagrandury

authored 4 papers 3 months ago

Adding LLMs to the psycholinguistic norming toolbox: A practical guide to getting the most out of human ratings

Paper • 2509.14405 • Published Sep 17 • 2

Psycholinguistic Word Features: a New Approach for the Evaluation of LLMs Alignment with Humans

Paper • 2506.22439 • Published May 29 • 3

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

Paper • 2509.14233 • Published Sep 17 • 13

La Leaderboard: A Large Language Model Leaderboard for Spanish Varieties and Languages of Spain and Latin America

Paper • 2507.00999 • Published Jul 1 • 1

lysandre

posted an update 3 months ago

Post

7117

We're kick-starting the process of Transformers v5, with @ArthurZ and @cyrilvallez !

v5 should be significant: we're using it as a milestone for performance optimizations, saner defaults, and a much cleaner code base worthy of 2025.

Fun fact: v4.0.0-rc-1 came out on Nov 19, 2020, nearly five years ago!

6 replies

·

w11wo

authored a paper 4 months ago

Multi-Stage Verification-Centric Framework for Mitigating Hallucination in Multi-Modal RAG

Paper • 2507.20136 • Published Jul 27

nipunsadvilkar

in flax-community/roberta-base-mr 5 months ago

Adding `safetensors` variant of this model

#1 opened 10 months ago by

gabisurita

authored a paper 5 months ago

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Paper • 2507.06261 • Published Jul 7 • 64

Mrinal

authored a paper 5 months ago

Multilingual State Space Models for Structured Question Answering in Indic Languages

Paper • 2502.01673 • Published Feb 1 • 2

fgaim

authored a paper 5 months ago

A Multi-Task Benchmark for Abusive Language Detection in Low-Resource Settings

Paper • 2505.12116 • Published May 17

thomwolf

authored a paper 6 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 75