Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows Paper • 2512.16969 • Published 10 days ago • 105
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 11 days ago • 79
sonoisa/sentence-bert-base-ja-mean-tokens-v2 Feature Extraction • 0.1B • Updated Apr 17, 2024 • 54k • • 51
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published Aug 5 • 121
BioBERT Collection This collection hosts BioBERT (Bioinformatics 2020) series, a domain-specific adaptation of BERT pre-trained on biomedical corpora. • 9 items • Updated Oct 17, 2024 • 4
BioBERT: a pre-trained biomedical language representation model for biomedical text mining Paper • 1901.08746 • Published Jan 25, 2019 • 6
Running 306 LLM Embeddings Explained: A Visual and Intuitive Guide 🚀 306 How Language Models Turn Text into Meaning, From Traditional
Energy-Based Transformers are Scalable Learners and Thinkers Paper • 2507.02092 • Published Jul 2 • 69
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published Mar 31 • 301
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity Paper • 2503.16418 • Published Mar 20 • 36