Mining dutch history researching public debate in the nineteenth century
Download
1 / 22

Mining Dutch History: researching public debate in the nineteenth century - PowerPoint PPT Presentation


  • 55 Views
  • Uploaded on

Mining Dutch History: researching public debate in the nineteenth century. Dr José de Kruif Researcher Research Institute for History and Culture Utrecht University. Newspaper (1840). 2. Pamphlet production. 3. Pamphlet april 1853. 4. Text fragments considered typical.

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about ' Mining Dutch History: researching public debate in the nineteenth century' - patricia-whitley


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Mining dutch history researching public debate in the nineteenth century

Mining Dutch History: researching public debate in the nineteenth century

Dr José de Kruif

Researcher

Research Institute for History and Culture

Utrecht University


Newspaper 1840
Newspaper (1840) nineteenth century

2


Pamphlet production
Pamphlet production nineteenth century

3


Pamphlet april 1853
Pamphlet april 1853 nineteenth century

4


Text fragments considered typical
Text fragments considered typical nineteenth century

We gaan naar den grond met die verdraagzaamheid, en verliezen onze eigene vrijheid terwijl wij zoo dolzinnig ijveren voor die van anderen. We zullen er de vruchten van plukken, als de inquisitie regt spreekt op onzen vrijen grond en de schavotten staan opgerigt voor ons en onze kinderen.“

“Tolerance will be our Waterloo. We will loose our freedom whilst devoting ourselves to the freedom of others. We will only recognize the fruits of our ignorance when the inquisition judges on our free soil and the scaffolds will be the fate of ourselves and our children.”

Bij gevolg kan elk middel, hoe snood , hoe onredelijk, hoe

goddeloos ook, aangewend worden: staatkundige verdeeldheid

revolutie, burgertwist, inquisitie, brandstapels, vergif, zede-

loosheid , koningsmoord,... Ziedaar wapenen in handen der

Jezuïten !

“Every means, however nasty, malicious or blasphemous can be used: inciting civil war, revolution, inquisition, burning at the stake, poison, murdering the king …are all weapons in the hands of the Jesuits.”

5


Digitizing database
Digitizing, database nineteenth century

Textmining

Documents

Scan

OCR Text

Results

Database

Meta data

6


Access database
Access Database nineteenth century

7


Extracted results
Extracted results nineteenth century

8


Synonyms jesuits
Synonyms Jesuits nineteenth century

9


Refining extraction results
Refining extraction results nineteenth century

10


Actors 1853
Actors 1853 nineteenth century

11


Text link analysis definitions
Text Link analysis definitions nineteenth century

12


Opinions on the pope
Opinions on the pope nineteenth century

13



Categories arguments
Categories arguments well…......

16


Textmining node and anomaly
Textmining node and anomaly well…......

17


Peer groups outliers
Peer groups & outliers well…......

Group 1: History & civil disorder

Group 2: History & new constitution

Group 3: No history. Civil disorder

Group 4: Very moderate & 3 outliers

18



Advantages
Advantages well…......

  • Gives insight into large number of documents. No need to use just a few and run the risk of not having a representative sample

  • Combining advantages of text analysis with statistical techniques.

  • Possibility to enrich the dictionary of the software with specific domain knowledge.

  • - New approaches possible

20


Set backs
Set-backs well…......

  • The researcher will need some knowledge of the documents and their subject to be able to interpret the results.

  • The approach is especially apt for broad research of large quantities of text. The more one zooms in, the less relevant the cluster results will become.

  • -Supplementing the lexical universe of the software with specific domain knowledge might be time-consuming.

  • - The researcher will have to be familiar, or will need to familiarize him or herself, with a number of statistic techniques (e.g. cluster analysis).

21


Mining dutch history researching public debate in the nineteenth century1

Mining Dutch History: researching public debate in the nineteenth century

Dr José de Kruif

Researcher

Research Institute for History and Culture

Utrecht University


ad