HTML/Related Articles

From Citizendium
< HTML
Revision as of 20:09, 22 September 2010 by imported>Pat Palmer (→‎Other related topics)
Jump to navigation Jump to search
This article is developing and not approved.
Main Article
Discussion
Related Articles  [?]
Bibliography  [?]
External Links  [?]
Citable Version  [?]
 
A list of Citizendium articles, and planned articles, about HTML.
See also changes related to HTML, or pages that link to HTML or to this page or whose text contains "HTML".

Parent topics

  • SGML [r]: Standard Graphical Markup Language. A universal standard used by designers and printers for page layout. [e]
  • Page layout [r]: (Noun) The configuration of headings, body text and other items that make up a printed page. (Verb) Also, the actual process of designing and putting in place those elements. [e]

Subtopics

Other related topics

  • Adobe Flash [r]: Extremely popular multimedia authoring and playback system from Adobe, where flash formats are used for most of the animated ads and video clips on today's Web sites. [e]
  • Ajax (web technology) [r]: JavaScript programming technique to communicate with the server without reloading the webpage. [e]
  • Cascading Style Sheets [r]: A format designed by the W3C for describing the presentation, layout and other design choices of a document on the Web. [e]
  • Comparison of Java and .NET [r]: Add brief definition or description
  • Google Web Toolkit [r]: Open source web framework written in Java. [e]
  • Gopher (protocol) [r]: A deprecated search and retrieval network protocol which ran on the Internet, which saw much of its heyday in the 1980s before the World Wide Web became popular [e]
  • HTML5 [r]: HTML5 is the next generation hypertext markup language standard published by the World Wide Web Consortium to provide new ways of presenting content on the World Wide Web that include transition effects, animation, video, and more. [e]
  • HTTP [r]: Network protocol on which the World Wide Web is based. [e]
  • Internet Engineering Task Force [r]: Internet standards body that operates on a consensus-based model. [e]
  • Java platform [r]: A bunch of programs needed for creating and running programs written in the Java programming language. [e]
  • JavaScript [r]: General-purpose computer programming language that is frequently embedded within HTML pages on the World Wide Web to make pages more interactive. [e]
  • Mashup [r]: A data visualization created by combining data with multiple computer applications. [e]
  • Metadata [r]: Data that describes, or is about, other data. [e]
  • .NET Framework [r]: Bundle of interdependent programs required to create and run programs using C# and multiple other programming languages on most versions of Microsoft Windows. [e]
  • Tim Berners-Lee [r]: British software developer famous for creating the World Wide Web. [e]
  • Web browser [r]: A computer program that retrieves and renders webpages to display information stored on a web server. [e]
  • Web server [r]: Add brief definition or description
  • XHTML [r]: a form of web page markup language which is similar to HTML but adheres to stricter syntax rules, being based on XML [e]
  • XML [r]:
This article is developing and not approved.
Main Article
Discussion
Related Articles  [?]
Bibliography  [?]
External Links  [?]
Citable Version  [?]
 
A list of Citizendium articles, and planned articles, about HTML.
See also changes related to HTML, or pages that link to HTML or to this page or whose text contains "HTML".

XML (eXtensible Markup Language) is a platform-independent, human-readable format for representing data. It was released as a W3C Recommendation in November 1998 and, perhaps because of its platform independence and readability both by humans and software, obtained immediate acceptance. Today, XML is in widespread use across the World Wide Web, especially for sending data between computers and for serializing data to disk, and most computer programming languages include support for it. XML is one of several wire formats used in Ajax (Asynchronous Javascript And Xml), the background interaction of a web page with programs residing elsewhere on a network.

XML Specification and Origin

XML is a subset of the Standard Generalized Markup Language, or SGML (ISO8879-1986). XML's specification first emerged in 1996 through the efforts of the XML Special Interest Group and the SGML Editorial Review Board, chaired by John Bosak of Sun Microsystems. The group, also known as the XML Working Group, laid out the following set of guidelines or design goals for XML:

  1. XML shall be straightforwardly usable over the Internet.
  2. XML shall support a wide variety of applications.
  3. XML shall be compatible with SGML.
  4. It shall be easy to write programs which process XML documents.
  5. The number of optional features in XML is to be kept to the absolute minimum, ideally zero.
  6. XML documents should be human-legible and reasonably clear.
  7. The XML design should be prepared quickly.
  8. The design of XML shall be formal and concise.
  9. XML documents shall be easy to create.
  10. Terseness in XML markup is of minimal importance.

[1]

Notably missing from the above list is efficiency. XML is plain text and can be verbose, and in applications where speed is everything, a different data representation may be preferred.

Grammar and Definitions

This section contains a brief over view of the components that define XML, for a more in-depth examination of those components read the individual sections under the heading Structure


Prolog - A prolog is a sort of 'announcement' to the XML parser as to what version the following set of XML objects is written in. The prolog also contains other information pertinent to the XML document (discussed later.)

<?xml version="1.0" standalone="no" ?>


Tag - A tag denotes the beginning, end, or existence of an XML object. They consist of the < character followed by the name of the tag, and, in the case of an opening tag, just a > at the end. The exception is with end tags and single tags. End Tags always have a < then a forward slash before the name of the tag with a > at the end. Single tags always have a space after the name of the tag followed by a forward slash and then end with the >.

Example

<ninja>
    <inventory>
        <nunchucks />
        <cookies />
    </inventory>
</ninja>


Element - An element is an XML object composed of a start tag and a stop tag or a single tag.

Example

<?xml version="1.0" standalone="no" ?>
<family>
    <member>
        <name>dad</name>
        <favorite-food>pancakes</favorite-food>
        <favorite-animal>wombats</favorite-animal>
    </member>
</family>

Document - A document is any XML object that contains a prolog and one or more elements excluding the root element.

<?xml version="1.0" standalone="no" ?>
<root-element>
    <lots-of-elements>
    ...
    </lots-of-elements>
</root-element>

Structure

Prolog

The XML prolog is a mandatory object present at the beginning of the document.

<?xml version="###" {encoding="???" standalone='y/n'} ?>

version equals the version of XML the document was written to.

(Optional) encoding equals the character codebook the document was written to. The default is UTF-8. The character encoding declaration must be written using latin characters only.[2]


EncName    ::=    [A-Za-z] ([A-Za-z0-9._] | '-')* 

The encoding name must begin with an alphabetic character and all other characters must be alphanumeric, the underscore ( _ ), the decimal ( . ), or a dash ( - ).


(Optional) standalone is either yes or no. Yes in the instance that the XML document does not have an external DTD or Schema. No if the XML document does have an external DTD or Schema.

Elements

An element is an XML object defined by the accompanying DTD or Schema that is described by the use of tags, parsed text, attributes, and entities. An element may consist of a start tag and a end tag or a single tag.

<?xml version="1.0" standalone="no" ?>
<army>
    <ninja name="woody">
        <inventory>
            <nunchucks count="2" />
            <cookies count="15" />
        </inventory>
    </ninja>
</army>

In this case, <army> represents a root element that contains a series of elements, in this example the element shown is a <ninja>. A ninja has an inventory that contains a certain number of nunchucks and a certain number of cookies. The number of cookies and nunchucks are defined by the attribute count. Note, the attribute values are in quotation marks, all well formed XML documents have their attributes quoted in that fashion either with single quotes ' or double quotes ".

References

  1. Bray, Tim, Jean Paoli, C. M. Sperberg-McQueen, Eve Maler, and François Yergeau, eds. "Extensible Markup Language (XML) 1.0 (Fourth Edition)." World Wide Web Consortium Recommendations. 29 Sept. 2006. 18 May 2007 <http://www.w3.org/TR/2006/REC-xml-20060816/#sec-origin-goals>.
  2. Bray, Tim, Jean Paoli, C. M. Sperberg-McQueen, Eve Maler, and François Yergeau, eds. "Extensible Markup Language (XML) 1.0 (Fourth Edition)." World Wide Web Consortium Recommendations. 29 Sept. 2006. 18 May 2007 <http://www.w3.org/TR/2004/REC-xml-20040204/#NT-EncodingDecl> §4.3.3 Character Encoding in Entities.

External links

Related Technologies

Bot-suggested topics

Auto-populated based on Special:WhatLinksHere/HTML. Needs checking by a human.