La Très Grande Bibliothèque

In collaboration with the Observatoire des textes, des idées et des corpus (ObTIC) team at the Sorbonne, we are pleased to announce the first-ever PhiloLogic4 build of the Très Grande Bibliothèque (TGB) corpus. The TGB is a collection of documents from the Gallica digital library at the Bibliothèque nationale de France.

The corpus we are making available contains 112,907 texts published primarily in the 19th century and is a direct result of our previous project, Practices and Legacies of 18th-century Culture with the Labex OBVIL. The texts are the result of OCR processing, therefore the quality of the digitized text depends in large part on the state of the source.


Access the TGB corpus

You can access the full corpus (130,000 texts) from the BnF at this link: https://api.bnf.fr/documents-de-gallica-produits-au-format-tei-par-obvil