130 likes | 217 Views
This workshop explores leveraging the Semantic Web and Canonical Text Services to enhance humanities citations and references. Topics include digitizing Latin texts, citation standards, and improving access and internationalization. The session discusses mapping CTS URN to URL, integrating ancient language entities, and utilizing CIDOC-CRM and Dbpedia for text descriptions.
E N D
Bruce Robertson, Classics, Mount Allison University Atlantic Workshop on Semantics and Services UNB June 14-15 Unwinding Humanities Citations
Dynamic Variorum Editions • Digging into Data Challenge project to digitize and align tens of thousands of Latin texts • How do we use these? • Citation • Quotation
Rethinking Humanities Citations • John 3:16, etc. form a sort of early, non-actionable semantic web • Pros: • World standard of pointers • Cons: • Abbreviated, for economical reasons: P Oxy, FgrH • Which Greek author is Ar.? • If you're English-speaking, Aristophanes • If you're French-speaking, it might be Aristotle • Difficult to access the referred-to text
Replacing the Citation • Within the web of information, use Canonical Text Services URNs: urn:cts:greekLit:tlg0532:tlg001:1:10 ^collection ^auth ^work ^subsections • In browser: • reject the abbreviation wholly • Exploit dbpedia, etc. to provide internationalized (or partially internationalized) labels for the references • Make all references able to be de-referenced
Desiderata • Standard means of mapping CTS URN to URL • Taking a very naive approach • Entities for ancient languages, peoples, literary works and their extensively internationalized rdfs:labels • Dbpedia • Means of integrating these two • Swiftowlim store with efficient owl:sameAs
#describe the text in CIDOC-CRM and dbpedia • <http://heml.mta.ca/text/urn/tlg0003/tlg001> a crm:E33. • <http://heml.mta.ca/text/urn/tlg0003/tlg001> crm:P72F dbpedia:Ancient_greek. • #Note that it has a translation • <http://heml.mta.ca/text/urn/tlg003/tlg001> crm:P73 <http://heml.mta.ca/text/urn/tlg0003/tlg001en>. • <http://heml.mta.ca/text/urn/tlg0003/tlg001en> crm:P72F dbpedia:English_language. • #Identify the author in dbpedia terms • <http://heml.mta.ca/text/urn/tlg0003> owl:sameAs dbpedia:Thucydides. • <http://heml.mta.ca/text/urn/tlg0003> a crm:E39. • #Identify the text in dbpedia terms • <http://heml.mta.ca/text/urn/tlg0003/tlg001> owl:sameAs dbpedia:Peloponnesian_War. • #State the relationship between author and text • <http://heml.mta.ca/text/creation_of_tlg0003_tlg_0001> a crm:E65.
HTML5 'data' Attribute • <div data-ctsurn="urn:cts:greekLit:tlg0003:tlg001:2:10"/>
Future Work • Provide links to chunked, rendered text • Perseus server • CTS server • Scan works in, e.g., JSTOR, for standard citations, and replace with this system • Apply these techniques to event labels • Associate events with canonical citations for further research in secondary works • Expose more finely grained information: syntax, semantics, grammar of Latin sentences