• ATILF - CNRS
  • UNIVERSITY OF CHICAGO
  • DIVISION OF THE HUMANITIES
  • UNIVERSITY OF CHICAGO LIBRARY

The ARTFL Project

  • What's new
  • Papers & Presentations
  • PhiloLogic
  • Subscription Info
  • About ARTFL
  • Contact us
Home

Table of Contents

  • ARTFL Resources
  • About ARTFL
  • Subscription Information

Archives parlementaires (May 2013)

§ Search the Archives: (e.g., tradition)
Word Similarity Search

§ Select a Search Option:
Single Term and Phrase Search (default) Phrase separated by words
Proximity Searching in the same Sentence or in the same Paragraph

§ Select a Results Format: Occurrences with Context (default) Occurrences Line by Line (KWIC)


§ Limit Searches by:
Header: (e.g., juillet 1790)
Div Type: (e.g., session or annexe)
Speaker: (e.g., Condorcet)
Volume: (e.g., APvol49)
Date: (e.g., 1793)
Volume Date: (e.g., 1790-09-16)
Session Date: (e.g., 1790-09-16) Experimental:
SubDiv Tag: (e.g., list or sp)

§ Refined Search Results:
Frequency by Div
Frequency by Title   Frequency by Title per 10,000
Frequency by Years   Frequency by Years per 10,000
Collocation Table Spanning words. Turn Filter Off: Filtered Words
Word in Clause Position (Theme-Rheme) Display Options:
Line by Line (KWIC) Sorted by keyword and word to its Display up to occurrences.

June 10: MVO copied database to back-up machine.

May 8, 2013:

Note:  This dataset is UNCORRECTED OCR (Optical Character Recognition) output taken from the Stanford Github repository. Please use this as an alpha or "proof-of-concept" database at this time.  This build contains all of the expected 82 volumes of text.

Current state information
Volumes 52, 49, 5, 71b, 34, 25, have suspiciously high rates of the unaccented form. (also many Latin-1 accents)
Volume 8 images need to be rotated.

I have my ingestion system to automatically to rename and modify the GitHub repository files. Modifications:
Added speaker identification to the "sp" tag in the TEI.
Added dates and volume dates to internal bibliographic metadata in the TEI
Added dates to "divs" identified as "session" in "div" level metadata in the TEI

To do: identify "cahiers" as "div" object type.

Humanities Division Wordmark

The ARTFL Project
Department of Romance Languages and Literatures
Division of the Humanities
University of Chicago
1115 East 58th Street Chicago, IL 60637
tel: 773-702-8488 | email: artfl[at]artfl[dot]uchicago[dot]edu
Privacy Notice

  • What's new
  • Papers & Presentations
  • PhiloLogic
  • Subscription Info
  • About ARTFL
  • Contact us