BIB18 – Curation and analyses of the dataset derived from the XVIIIe: Bibliographie


Some analyses of the dataset are available on the present website. The analyses are based on the RDF version of the data (currently, v0.3.0, January 20, 2024) and have been performed by Christof Schöch. Note that all numbers can be expected to shift slighly as the process of cleaning the data proceeds. Collaborative authorship / editorship as well as the publication types will most likely be most affected by corrections. The scripts used to create the analyses are integrated into the pages on this website for inspection (click on Code), but are also available as Jupyter Notebooks at

Source of the data: Benoît Melançon. For background information see his “Libération des données” and for the corresponding Dataverse deposit, see

For more information on the transformation process, and to obtain the dataset in various formats (BibTeX, Zotero RDF, JSON), see the Github repository The original transformation was performed in May 2023.

The full bibliography can also be consulted on Zotero, see:

Some remarks on the data

  • For publication dates, only the year has been used.
  • Correct ordering of first and last names of authors or publishers is difficult, so errors may persist.
  • The data source does not distinguish between URLs and DOIs; consequently, they are all listed in the URL field. However, few DOIs are present anyway.
  • Information on the language of a publication has been added automatically, using the lingua-py module. This information can be erroneous in some cases (and will be improved by hand).
  • After transformation into BibTex, the BibTeX-tidy tool was used:
  • The resulting files are available in the archive folder and form the basis for import into Zotero.
  • Several Zotero export formats are available in the data folder.
  • Future corrections to the data will be made on Zotero, with updates to the formats in the datafolder.

Nota bene

Benoît Melançon specifies: “Quiconque souhaite s’approprier ces données peut le faire, sous deux conditions. 1. L’attribution de la collecte des données doit toujours être rappelée, par exemple sous la forme «Données colligées par Benoît Melançon». 2. Aucune exploitation commerciale de ces données n’est tolérée. Elles ne peuvent pas être vendues sous quelque forme que ce soit. Autrement dit, chez Creative Commons : Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) —”

Citation suggestion

Christof Schöch. BIB18 – Curation and Analysis of the dataset derived from the XVIIIe: Bibliographie, v0.3.0., 2023. URL:, DOI: 10.5281/zenodo.8166110.

Notes on work in progress

This website is work in progress. All bugs, todos, ideas for improvement are listed in the issue tracker. Feel free to add issues if you notice something!