{"id":13056,"date":"2021-09-16T15:10:46","date_gmt":"2021-09-16T13:10:46","guid":{"rendered":"https:\/\/dans.knaw.nl\/file-formats\/markup-language\/"},"modified":"2025-01-24T10:36:46","modified_gmt":"2025-01-24T09:36:46","slug":"markup-language","status":"publish","type":"page","link":"https:\/\/dans.knaw.nl\/en\/file-formats\/markup-language\/","title":{"rendered":"Markup language"},"content":{"rendered":"<p>[et_pb_section fb_built=&#8221;1&#8243; _builder_version=&#8221;4.10.7&#8243; _module_preset=&#8221;default&#8221;][et_pb_row _builder_version=&#8221;4.10.7&#8243; _module_preset=&#8221;default&#8221;][et_pb_column _builder_version=&#8221;4.10.7&#8243; _module_preset=&#8221;default&#8221; type=&#8221;4_4&#8243;][et_pb_text _builder_version=&#8221;4.10.7&#8243; _module_preset=&#8221;default&#8221; hover_enabled=&#8221;0&#8243; sticky_enabled=&#8221;0&#8243;]<\/p>\n<h1><span style=\"font-weight: 400;\">Markup language<\/span><span style=\"font-weight: 400;\"><\/span><\/h1>\n<p><span style=\"font-weight: 400;\">Standardized General Markup Language (SGML) and Extensible Markup Language (XML) are markup languages used for text documents and datasets, both to present them to people and to enable data exchange between computers.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">XML is a variant of SGML: all XML files are SGML files. Since XML has a much stricter syntax, it is easier to validate. HTML (Hypertext Markup Language) is another variant of SGML; it is primarily intended for the presentation of rich text (and layout) and hyperlinks to other documents.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In addition to \u201cregular\u201d HTML there is also XHTML, which is HTML under the stricter rules of XML.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">SGML and XML are hardly being further developed. HTML has recently seen its latest version 5 officially recognized as W3C standard. As Web technology continues to develop, it is expected that HTML will continue to be developed further.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">XML, HTML and SGML are common and suitable markup language formats, provided the file formats are valid and complete (see paragraph below). Apart from these formats there are XML-based or SGML-based formats that can only be read by special software. Such files cannot be accepted without further verification; please check with DANS.<\/span><\/p>\n<h3><b>Validity<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">Valid markup language documents are both well-formed and comply with the rules that apply to the file formats.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Well-formed documents require that the content is defined in a particular manner. Well-formed XML complies with syntax rules that state, among other things, that the character set used is also the character set specified; that no prohibited characters are used in the file; that there is one root tag and that each &lt;tag&gt; is correctly terminated with a &lt;\/tag&gt;.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The rules governing the content of a markup document are described in a DTD (Document Type Definition) or (XML) schema file. At the top of XML and HTML documents there is a reference to the DTD or schema used. This reference should really lead to the schema file itself. Ideally, the schema should be attached, unless it is available at a reliable public service.<\/span><span style=\"font-weight: 400;\"><br \/><\/span><span style=\"font-weight: 400;\">If a non-standard schema or DTD file is used, the data depositor should consult DANS beforehand.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Through schemas and DTDs, entirely new \u201cfile formats\u201d can be defined, such as SVG (Scalable Vector Graphics, for vector images), TEI (Text Encoding Initiative, used to format and annotate text), and MathML (for mathematical formulas).<\/span><span style=\"font-weight: 400;\"><br \/><\/span><span style=\"font-weight: 400;\">The World Wide Web Consortium (W3C) manages the specifications for HTML and XML, and provides a <\/span><a href=\"https:\/\/validator.w3.org\/\"><span style=\"font-weight: 400;\">Markup Validator<\/span><\/a><span style=\"font-weight: 400;\"> that can validate both XHTML and HTML. In addition, it can validate a number of other formats, such as MathML and SMIL.<\/span><\/p>\n<h3><b>Completeness<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">Markup language may be based on the use of other file formats, either in separate files or within one file. All files associated with an XML\/HTML\/SGML file must be included. Common markup language related files are XLST stylesheets, CSS definition files and JS\/ES scripting languages, see related files below.<\/span><\/p>\n<h3><b>Preferred formats\u00a0<\/b><\/h3>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><a href=\"https:\/\/dans.knaw.nl\/en\/file-formats\/markup-language\/xml\/\"><span style=\"font-weight: 400;\">XML (.xml)<\/span><\/a><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><a href=\"https:\/\/dans.knaw.nl\/en\/file-formats\/markup-language\/html\/\"><span style=\"font-weight: 400;\">HTML (.html)<\/span><\/a><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Related files: <\/span><a href=\"https:\/\/dans.knaw.nl\/en\/file-formats\/markup-language\/css\/\"><span style=\"font-weight: 400;\">.css<\/span><\/a><span style=\"font-weight: 400;\">, <\/span><a href=\"https:\/\/dans.knaw.nl\/en\/file-formats\/markup-language\/xslt\/\"><span style=\"font-weight: 400;\">.xslt<\/span><\/a><span style=\"font-weight: 400;\">, <\/span><a href=\"https:\/\/dans.knaw.nl\/en\/file-formats\/markup-language\/script\/\"><span style=\"font-weight: 400;\">.js<\/span><\/a><span style=\"font-weight: 400;\">, <\/span><a href=\"https:\/\/dans.knaw.nl\/en\/file-formats\/markup-language\/script\/\"><span style=\"font-weight: 400;\">.es<\/span><\/a><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><a href=\"https:\/\/dans.knaw.nl\/en\/file-formats\/markup-language\/markdown\/\"><span style=\"font-weight: 400;\">Markdown (.md)<\/span><\/a><\/li>\n<\/ul>\n<h3><b>Non-preferred formats\u00a0<\/b><\/h3>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><a href=\"https:\/\/dans.knaw.nl\/en\/file-formats\/markup-language\/sgml\/\"><span style=\"font-weight: 400;\">SGML (.sgml)<\/span><\/a><\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">\u00a9 DANS. R.5.5.T.3 Version 1.0, January 17, 2025<\/span><\/p>\n<p>[\/et_pb_text][\/et_pb_column][\/et_pb_row][\/et_pb_section]<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Markup language Standardized General Markup Language (SGML) and Extensible Markup Language (XML) are markup languages used for text documents and datasets, both to present them to people and to enable data exchange between computers. XML is a variant of SGML: all XML files are SGML files. Since XML has a much stricter syntax, it is [&hellip;]<\/p>\n","protected":false},"author":8,"featured_media":0,"parent":12590,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"_acf_changed":false,"_et_pb_use_builder":"on","_et_pb_old_content":"","_et_gb_content_width":"","inline_featured_image":false,"footnotes":""},"class_list":["post-13056","page","type-page","status-publish","hentry"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>File formats: Markup language | DANS<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dans.knaw.nl\/en\/file-formats\/markup-language\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"File formats: Markup language | DANS\" \/>\n<meta property=\"og:description\" content=\"Markup language Standardized General Markup Language (SGML) and Extensible Markup Language (XML) are markup languages used for text documents and datasets, both to present them to people and to enable data exchange between computers. XML is a variant of SGML: all XML files are SGML files. Since XML has a much stricter syntax, it is [&hellip;]\" \/>\n<meta property=\"og:site_name\" content=\"DANS\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/Data-Archiving-and-Networked-Services-DANS-140231255994524\/\" \/>\n<meta property=\"article:modified_time\" content=\"2025-01-24T09:36:46+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:site\" content=\"@dans_knaw_nwo\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/dans.knaw.nl\\\/en\\\/file-formats\\\/markup-language\\\/\",\"url\":\"https:\\\/\\\/dans.knaw.nl\\\/en\\\/file-formats\\\/markup-language\\\/\",\"name\":\"File formats: Markup language | DANS\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/dans.knaw.nl\\\/en\\\/#website\"},\"datePublished\":\"2021-09-16T13:10:46+00:00\",\"dateModified\":\"2025-01-24T09:36:46+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/dans.knaw.nl\\\/en\\\/file-formats\\\/markup-language\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/dans.knaw.nl\\\/en\\\/file-formats\\\/markup-language\\\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/dans.knaw.nl\\\/en\\\/file-formats\\\/markup-language\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/dans.knaw.nl\\\/en\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"File formats\",\"item\":\"https:\\\/\\\/dans.knaw.nl\\\/en\\\/file-formats\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Markup language\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/dans.knaw.nl\\\/en\\\/#website\",\"url\":\"https:\\\/\\\/dans.knaw.nl\\\/en\\\/\",\"name\":\"DANS\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/dans.knaw.nl\\\/en\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"File formats: Markup language | DANS","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dans.knaw.nl\/en\/file-formats\/markup-language\/","og_locale":"en_US","og_type":"article","og_title":"File formats: Markup language | DANS","og_description":"Markup language Standardized General Markup Language (SGML) and Extensible Markup Language (XML) are markup languages used for text documents and datasets, both to present them to people and to enable data exchange between computers. XML is a variant of SGML: all XML files are SGML files. Since XML has a much stricter syntax, it is [&hellip;]","og_site_name":"DANS","article_publisher":"https:\/\/www.facebook.com\/Data-Archiving-and-Networked-Services-DANS-140231255994524\/","article_modified_time":"2025-01-24T09:36:46+00:00","twitter_card":"summary_large_image","twitter_site":"@dans_knaw_nwo","twitter_misc":{"Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/dans.knaw.nl\/en\/file-formats\/markup-language\/","url":"https:\/\/dans.knaw.nl\/en\/file-formats\/markup-language\/","name":"File formats: Markup language | DANS","isPartOf":{"@id":"https:\/\/dans.knaw.nl\/en\/#website"},"datePublished":"2021-09-16T13:10:46+00:00","dateModified":"2025-01-24T09:36:46+00:00","breadcrumb":{"@id":"https:\/\/dans.knaw.nl\/en\/file-formats\/markup-language\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dans.knaw.nl\/en\/file-formats\/markup-language\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/dans.knaw.nl\/en\/file-formats\/markup-language\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/dans.knaw.nl\/en\/"},{"@type":"ListItem","position":2,"name":"File formats","item":"https:\/\/dans.knaw.nl\/en\/file-formats\/"},{"@type":"ListItem","position":3,"name":"Markup language"}]},{"@type":"WebSite","@id":"https:\/\/dans.knaw.nl\/en\/#website","url":"https:\/\/dans.knaw.nl\/en\/","name":"DANS","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dans.knaw.nl\/en\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"}]}},"_links":{"self":[{"href":"https:\/\/dans.knaw.nl\/en\/wp-json\/wp\/v2\/pages\/13056","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dans.knaw.nl\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/dans.knaw.nl\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/dans.knaw.nl\/en\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/dans.knaw.nl\/en\/wp-json\/wp\/v2\/comments?post=13056"}],"version-history":[{"count":1,"href":"https:\/\/dans.knaw.nl\/en\/wp-json\/wp\/v2\/pages\/13056\/revisions"}],"predecessor-version":[{"id":23176,"href":"https:\/\/dans.knaw.nl\/en\/wp-json\/wp\/v2\/pages\/13056\/revisions\/23176"}],"up":[{"embeddable":true,"href":"https:\/\/dans.knaw.nl\/en\/wp-json\/wp\/v2\/pages\/12590"}],"wp:attachment":[{"href":"https:\/\/dans.knaw.nl\/en\/wp-json\/wp\/v2\/media?parent=13056"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}