Markup UK 2018 Proceedings

Table of Contents

Shared Tag Sets as Social Constructs
From the Mouth of a Novice
The Bare Bones XML specification
Advantages of Shared Vocabularies
Of course All is not sweetness and light
Mixing Vocabularies (by cut and paste not namespaces)
Natural/Intuitive Tagging (Influences of Shared Vocabularies on Thinking)
Shared Wisdom: the culture and practice of XML
Tag Set Cultures
Different use cases make for different cultures
Community coercion: All is still not sweetness and light
But ...
How to Make a Flying Start with S1000D – Lessons Learned at the Airport
Introduction to S1000D
What is S1000D?
Principal Concepts
Data Modules
Publication Modules
Common Source Database
Introduction to the Knowledge Warehouse Project
Migration Planning
Data Modules
The Data Module Code
Model Identification Code
System Difference Code
Standard Numbering System
Other Codes
Putting It All Together
Data Module Title
Publication Modules
The Publication Module Code
Model Identification Code
Issuing Authority
Publication Number
Volume Number
Putting It All Together
Publication Module Entry Title
Business Rules
Business Rule Decision Points
Business Rules Exchange (BREX)
Context Rules
Context Independent Rules
SNS Rules
Lessons Learned
Spreadsheet It!
Hearts and Minds
Two (and a half) models for markup of bibliographic references
Bibliographic reference formats
Online functionality of bibliographic references
OUP data model
BITS data model
Converting the OUP data model to BITS
Outline of the XSLT
Limitations of the conversion
The Cryptic Crossword Corpus Project: first steps in establishing a markup vocabulary
A brief explanation of cryptic crosswords
Developing the markup vocabulary for CCCP
Modern amendment drafting - The road to XML
Validating amendments
Ordering amendments
Problem definitions
Problem 1: Segment amendments in an amendments document
Problem 2: Recognize the location, action and operand information
Problem 3: Order the amendments according to the rule set
Problem 4: Generate the simulation
Segmenting amendments (Problem #1)
Linear Chain Conditional Random Fields (CRF)
Model description
Recognizing location, action and operand information (Problem #2)
Validating amendments
Ordering the amendments (Problem #3)
Simulating the effect (Problem #4)
Human in the loop
Conclusions and further work
Introduction to CSS for Paged Media
Web and Paged Media
@media Rule
Specifying a Print Style Sheet
<style> Element
@import Rule
Media attribute of <style> and <link> elements
Differences Between Screen and Paged Media
Design approach
Left and right pages
The printed book
Page Setting
Western Page Design
Japanese Page Design
@page Rule
Named Page : page property
Crop and Registration Marks
Page bleed area
Headers and Footers
Margin Boxes
Running Headers and Page Numbers
Running header setting : string-set property and string() function
Variable strings : string-set property
Move elements to header/footer : running() position value
Insert a running element: element()
Page number : counter(page)
Total pages : counter(pages)
Left and Right Page Headers: :left and :right
Last and only pages: :last and :only
PDF Output
PDF versions
Tagged PDF
Matterhorn Protocol
PAC 3 PDF/UA checker
Document properties
Extensible Metadata Platform (XMP)
Colour Specification
Printing colour
Text Colour : color property
CMYK colours
PANTONE® spot colours
Other spot colours
Numbering Chapters and Sections
Inserting Characters : content property
Incrementing Counters : counter-increment property
Counter Reset : counter-reset property
Page counter
Counter styles
Defining Custom Counter Styles : @counter-style rule
Predefined Counter Styles
The Wolfenbüttel emblem2rdf Pipeline
Emblematica Online – Linked Open Emblem Data
Encoding emblems
The publication process as XProc pipeline
Data entry and initial transformation
TEI to Emblem Schema
Emblem Schema documents
Running the emblem2rdf pipeline
CREPDL: Protect Yourself from the Proliferation of Unicode Characters
Subsets in Unicode
Subsets in ISO/IEC 10646
Code Points and Ranges
Open Collections and Fixed Collections
References to Other Collections
Grapheme Clusters
User-defined Subsets
Existing Machine-readable Notations for Describing Subsets
Unicode Regular Expressions
Referencing other subsets
A Notation for Character Collections for the WWW
Design and Implementation of CREPDL
Language Design
Concluding Remarks and Future Works
Rethinking transformation – the potential of code generation
Rethinking document-to-document transformation
Source navigation based model
Basic concepts
Primitive operations
Composing document-to-document transformation
Metadata model SNAT
Metadata item model
Advanced features
Value mappings
User-defined functions
User-defined variables
Code generator SNAT
Source alignment based model
What is an alignment?
Representation versus information
Annotated target tree (SAAT)
Mapping alignments to navigation
Inferring context propagation
Semantic versus structural relationships
Future work
Metadata model SAAT (Source Alignment Annotated Target tree)
Minimal SAAT
Alignment qualifiers
Code generator SAAT
AT Map Machine
Does SNAT presuppose XML?
Does SAAT presuppose XML?
Scheme for building SNAT-based code generators
Metadata item model
Metadata value model
Code assembly model
Using SNAT as a meta model
Proof of concept: RDF-to-XML, SNAT-based
Metadata item model
Metadata value model
Code assembly model
Example SNAT and source code
Non-XML workflows with XProc 3.0
Reprise: Non-XML documents in XProc 1.0
XProc 3.0's new concept of a document
Applying the model
Lightweight XML DevOps using Apache Ant
Development Policy
The ANT macro library
Example of use
Macro construction
XPantS contents
Build file example
An XSD 1.1 Schema Validator Written in XSLT 3.0
The Validation Task
Design Considerations
Generic Stylesheet or Generated Stylesheet?
Subset of XSLT 3.0
Typed data
Support for xsi:schemaLocation
Use of Maps for Returned Values
Declaring Map Types
Assessment against Complex Types using Finite State Machines
Checking Assertions
Other Complications
When Overlapping XML Meets Changing XML Does Confusion Reign?
Introduction and Background
How Content Duplication Represents Any Change
Representing Structural Change without Content Duplication
Dominant Hierarchy
Processing Observations
The XForms 2.0 Test Suite
Introduction to XForms
Test Suites
The XForms 1.* test suite
The XForms 2.0 test suite
The Generic Structure of the Tests

List of Figures

1. Validation warnings
2. Simulation result
1. Effect of show-through with non-aligned and aligned text
2. Effect of binding method
3. Page and Margin Area
4. Traditional Western page design
5. page property selects named page
6. Crop mark terms
7. Page bleed
8. Location of each margin box
9. Three pages with different headers and footers
10. Matterhorn Protocol failure conditions for tables
11. PAC 3 PDF/UA checker
12. Document properties shown by Acrobat Reader
13. XMP properties viewed in Acrobat
14. RGB and CMY
15. sRGB and CMYK gamuts
16. Greyscale levels on the separation correspond to the level of tint to apply
1. Emblem E018850 encoded in TEI
2. The same emblem E018850 encoded as Emblem Schema
1. Pseudo code of get-local-instantiation, a function implementing local instantiation. An auxiliary function new-node is assumed to be available, which constructs a node, given the node kind, the node name and the node contents.

List of Tables

1. S1000D Data Module Schemas
2. A word of advice..
3. A word of advice..
4. A word of advice..
5. A word of advice..
6. Data Module Code
7. A word of advice..
8. A word of advice..
9. A word of advice..
10. Baseline BRDP for the Knowledge Warehouse project
11. Other important BRDP for the Knowledge Warehouse project
12. BRDP that may have been beneficial to consider for Knowledge Warehouse but were not
1. Evaluation results
1. PDF specification versions
1. Code snippets taken from the example of a simple pull transformation.
2. Some concepts facilitating the decomposition of document to document transformation into primitive operations.
3. Primitive operations serving as building blocks of the local instantiation of a model node.
4. Metadata items SNAT
5. The assembly of primitive operations
6. A comparison between navigation steps and alignments.
7. Application of the “shortest-path-principle”.
8. Examples for a possible generalization of the node concept, meaning distinct items of which the resource is composed.
1. The structure of maps used to return partial results of processing

List of Examples

1. Book, Chicago style
2. Book, Harvard style
3. Book, Vancouver style
4. Journal article, Chicago style
5. Journal article, Harvard style
6. Journal article, Vancouver style
7. Journal article, style rendered by OUP Academic platform
8. OpenURL search
9. COPAC search
10. Google Preview
11. WorldCat search
12. OUP data model for a book reference
13. OUP data model for a journal article reference
14. BITS data model for a book reference
15. BITS data model for a journal article reference
16. Non-standard references
17. Stages of conversion
1. XQuery code generated for XML data source
2. SNAT document, defining the transformation described as an introductory example.
3. XQuery code, generated from the SNAT tree shown in the preceding listing.
4. Examples of SPARQL queries generated for RPath expressions.
5. SNAT document, defining the transformation of RDF data into an XML document.
6. XQuery code, generated from the SNAT document shown in the preceding listing.