FlauBERT: Unsupervised Language Model Pre-training for French


Sebastian Ruder: Transfer learning is increasingly going multilingual with language-specific BERT models: - 🇩🇪 German BERT - 🇫🇷 CamemBERT, FlauBERT - 🇮🇹 AlBERTo - 🇳🇱 RobBERT

Jeremy Howard: TIL Google invented universal language model pre-training with BERT.

laurent besacier: Our FlauBERT (French BERT) models have now been integrated into official @huggingface library with 4 below configurations !

Hang Le: Our FlauBERT is now natively supported by @huggingface's transformers library. Many thanks to @julien_c, @LysandreJik and the Hugging Face team for the active technical support! Paper (new version will be available soon): Code:

Hugging Face: You can now find most of them here:

D. Khuê Lê-Huu: This reminds me of this Flaubert paper: that gave proper credits to ULMFiT (by @jeremyphoward and @seb_ruder).

laurent besacier: The (LREC) camera-ready paper on FlauBERT is now online: . Includes new results with FlauBERT_large. All models available on @huggingface transformers library. Benchmark NLP tasks (FLUE) provided on

laurent besacier: Here is FlauBERT: a French LM learnt (with #CNRS J-Zay supercomputer) on a large and heterogeneous corpus. Along with it comes FLUE (evaluation setup for French NLP). FlauBERT was successfully applied to complex tasks (NLI, WSD, Parsing). More on

0 replies, 42 likes

Jay Alammar جهاد العمار: Somebody please make AraBERT happen!

Hang Le: Our work on FlauBERT and FLUE (language models and evaluation benchmark for French) have been released today (198th birthday of Gustave Flaubert). #Flaubert Paper: Code and models:

Dr Jochen L Leidner: French #NLP with the Transformer: From #BERT over CamemBERT to FlauBERT

Machine Learning: FlauBERT: Unsupervised Language Model Pre-training for French.

Julien Velcin: Between CamemBERT ( and FlauBERT (, which one will win the race? Anyway thank you for working on French-oriented NLP resources, and well done for finding such interesting names! #nlp #deeplearning #bert

Dominique Mariko: Comes with FLUE benchmark ! A GLUE for French!!

Christopher: FlauBERT - Unsupervised Language Model Pre-training for French. The repo contains pre-trained large & small models, all the data used plus code for training & inference. It also contains FLUE, a GLUE like benchmark for French NLProc

