348 26 3

Orion Weller PRO

orionweller

http://orionweller.com

AI & ML interests

None yet

Recent Activity

updated a dataset about 17 hours ago

mteb/results

new activity 17 days ago

orionweller/caselaw-access-project-tokens:🚩 Report: Copyright infringement

upvoted an article 21 days ago

mmBERT: ModernBERT goes Multilingual

View all activity

Organizations

upvoted an article 21 days ago

Article

mmBERT: ModernBERT goes Multilingual

Sep 9

•

129

upvoted a paper about 2 months ago

Controlled Generation for Private Synthetic Text

Paper • 2509.25729 • Published Sep 30 • 10

upvoted 2 papers 2 months ago

IA2: Alignment with ICL Activations Improves Supervised Fine-Tuning

Paper • 2509.22621 • Published Sep 26 • 8

The Flaw of Averages: Quantifying Uniformity of Performance on Benchmarks

Paper • 2509.25671 • Published Sep 30 • 6

upvoted an article 2 months ago

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

Oct 1

•

128

upvoted an article 3 months ago

Article

RexBERT: Encoders for a brave new world of E-Commerce

Sep 20

•

upvoted a paper 3 months ago

mmBERT: A Modern Multilingual Encoder with Annealed Language Learning

Paper • 2509.06888 • Published Sep 8 • 12

upvoted a collection 3 months ago

mmBERT: a modern multilingual encoder

Collection

mmBERT is trained on 3T tokens from over 1800 languages, showing SoTA scores on benchmarks and exceptional low-resource performance • 16 items • Updated Sep 9 • 49

upvoted a paper 3 months ago

On the Theoretical Limitations of Embedding-Based Retrieval

Paper • 2508.21038 • Published Aug 28 • 20

upvoted a collection 5 months ago

Encoders vs Decoders: the Ettin Suite

Collection

A collection of SOTA, open-data, paired encoder-only and decoder only models ranging from 17M params to 1B. See the paper at https://arxiv.org/abs/250 • 32 items • Updated Jul 16 • 25

upvoted a paper 5 months ago

Seq vs Seq: An Open Suite of Paired Encoders and Decoders

Paper • 2507.11412 • Published Jul 15 • 29

upvoted an article 5 months ago

Article

Ettin Suite: SoTA Paired Encoders and Decoders

Jul 16

•

upvoted a paper 5 months ago

The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure

Paper • 2506.22724 • Published Jun 28 • 10

upvoted a paper 7 months ago

Certified Mitigation of Worst-Case LLM Copyright Infringement

Paper • 2504.16046 • Published Apr 22 • 13

upvoted a paper 8 months ago

WikiVideo: Article Generation from Multiple Videos

Paper • 2504.00939 • Published Apr 1 • 37

upvoted 2 papers 9 months ago

Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning

Paper • 2503.04973 • Published Mar 6 • 26

Rank1: Test-Time Compute for Reranking in Information Retrieval

Paper • 2502.18418 • Published Feb 25 • 28

upvoted 2 papers 10 months ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19 • 42

Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering

Paper • 2502.13962 • Published Feb 19 • 28

upvoted a paper 11 months ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 95

Orion Weller PRO

AI & ML interests

Recent Activity

Organizations

orionweller's activity

mmBERT: ModernBERT goes Multilingual

Introducing RTEB: A New Standard for Retrieval Evaluation

RexBERT: Encoders for a brave new world of E-Commerce

Ettin Suite: SoTA Paired Encoders and Decoders