1 / 14

LIS1510 Library and Archives Automation Issues XML and extensible systems

LIS1510 Library and Archives Automation Issues XML and extensible systems. Andy Dawson School of Library, Archive & Information Studies, UCL (University of Malta 2008). What we will be covering today. Shortcomings of HTML Generalised markup languages How XML works XML document types

frankhenry
Download Presentation

LIS1510 Library and Archives Automation Issues XML and extensible systems

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. LIS1510Library and Archives Automation IssuesXML and extensible systems Andy Dawson School of Library, Archive & Information Studies, UCL(University of Malta 2008) Andy Dawson

  2. What we will be covering today • Shortcomings of HTML • Generalised markup languages • How XML works • XML document types • Other related extensible technologies

  3. Limitations of (X)HTML • Fixed tag set (specifications determined by W3C) • Intended for display of documents on the Web • Doesn’t do everything everyone wants • Not easy to use for other purposes • searching in documents • analysis of documents

  4. Principles of Generalized Markup • Descriptive markup – encodes features within a document • Say what those features are - not what to do with them • Need to define your own tags • Creates machine-independent data • Data can then be used for different purposes

  5. SGML • SGML – Standard Generalized Markup Language • International standard in 1986 • Metalanguage (syntactic framework) for defining markup tags • Parts of SGML are rather complex • Used by large projects • Not particularly easy to get started

  6. XML • XML (Extensible Markup Language) • Adopted by World Wide Web Consortium in 1998 • Cut-down version of SGML • Based on same principles • Designed to implement easily on the Web

  7. Advantages of XML • Machine-independent plain ASCII files • Potential longevity • Multi-purpose use • Ability to analyse/manipulate content • BUT need to define tag set! • Not a replacement for HTML unless analysis/manipulation of data is required • However, XHTML has become a ‘reliable’ alternative option for simple web publishing

  8. Defining Your Own Tags Need to undertake document analysis Identify key features in document Identify structure of document Choose names for tags Only then can we apply the tag scheme

  9. Example of a Newspaper Name of newspaper Issue Article Headline Author Paragraphs Pictures

  10. Basics of XML Syntax Documents are composed of elements Start and end tags for every element - unlike HTML, end tags must be present also “Empty elements” Attributes modify an element have a name and a value Value must be enclosed in matching quotes (single or double) An element may have several attributes Documents can be “Well-formed” or “Valid”

  11. Well-formed Documents Well-formed documents follow XML syntax i.e. start and end tags attributes in quotes nested structure But they have no pre-defined structure! Therefore: Can only check the syntax Cannot validate the structure of well-formed documents Prepares documents for potential use/conversion

  12. Valid Documents A Valid XML document contains (or refers to) a Document Type Definition (DTD) The DTD is a specification of the document structure identifying which elements are allowed where they are allowed which attributes they may take

  13. Related technologies • CSS – Cascading Style Sheets • As used with HTML • Concentrate only on appearance • XHTML • Version of HTML conformant with XML syntax • XSL - eXtensible Stylesheet Language • XML language for style sheets • Controls the appearance of the elements within the document & defines templates for processing elements • XML Schemas • Another way of defining document information

  14. That’s all folks… • Any questions? • Optional XML exercise is available…anyone? • Otherwise – carry on with your coursework • Next Tuesday: Website management and last chance to finish off your website! …and have a nice weekend 

More Related