In a Training Loop 🔄

1 12 43

Mou Chen

Mou11209203

emmanuelwithme

AI & ML interests

None yet

Recent Activity

reacted to julien-c's post with 🔥 25 days ago

BOOOOM: Today I'm dropping TINY AGENTS the 50 lines of code Agent in Javascript 🔥 I spent the last few weeks working on this, so I hope you will like it. I've been diving into MCP (Model Context Protocol) to understand what the hype was all about. It is fairly simple, but still quite powerful: MCP is a standard API to expose sets of Tools that can be hooked to LLMs. But while doing that, came my second realization: Once you have a MCP Client, an Agent is literally just a while loop on top of it. 🤯 ➡️ read it exclusively on the official HF blog: https://huggingface.co/blog/tiny-agents

upvoted an article 25 days ago

mmBERT: ModernBERT goes Multilingual

upvoted an article about 1 month ago

BioClinical ModernBERT: an example of continued pre-training of ModernBERT

View all activity

Organizations

upvoted an article 25 days ago

Article

mmBERT: ModernBERT goes Multilingual

Sep 9, 2025

•

133

upvoted an article about 1 month ago

Article

BioClinical ModernBERT: an example of continued pre-training of ModernBERT

Sep 10, 2025

•

upvoted a collection about 1 month ago

Common Pile v0.1

Collection

All resources related to Common Pile v0.1, an 8TB dataset of public domain and openly licensed text • 4 items • Updated Jun 6, 2025 • 39

upvoted a paper 2 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 158

upvoted 2 papers 3 months ago

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 175

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 501

upvoted 2 collections 8 months ago

Flan-T5 release

Collection

The Flan-T5 covers 4 checkpoints of different sizes each time. It also includes upgrades versions trained using Universal sampling • 7 items • Updated Jul 10, 2025 • 30

GTE models

Collection

General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 21 items • Updated Jan 21, 2025 • 33

upvoted an article 8 months ago

Article

Fine-tune ModernBERT for text classification using synthetic data

Dec 30, 2024

•

upvoted an article 9 months ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

•

717

upvoted a paper 9 months ago

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10, 2024 • 111

upvoted an article 10 months ago

Article

MTEB: Massive Text Embedding Benchmark

Oct 19, 2022

•

Mou Chen

AI & ML interests

Recent Activity

Organizations

Mou11209203's activity

mmBERT: ModernBERT goes Multilingual

BioClinical ModernBERT: an example of continued pre-training of ModernBERT

Fine-tune ModernBERT for text classification using synthetic data

Finally, a Replacement for BERT: Introducing ModernBERT

MTEB: Massive Text Embedding Benchmark