| HTML |
|---|
An HTML element is an individual component of an HTML document. HTML documents are composed of a tree of HTML elements and other nodes, such as text nodes. Each element can have attributes specified. Elements can also have content, including other elements and text. HTML elements represent semantics, or meaning. For example, the title element represents the title of the document.
In the HTML syntax, most elements are written with a start tag and an end tag, with the content in between. Tags are composed of the name of the element, surrounded by angle brackets. An end tag also has a slash after the opening angle bracket, to distinguish it from the start tag. For example, a paragraph, which is represented by the p element, would be written as
<p>In the HTML syntax, most elements are written ...</p>
However, not all of these elements require the end tag, or even the start tag, to be present. Some elements, the so-called void elements don't have an end tag. A typical example is the br element, which represents a significant line break, such as in a poem or an address. For example, the address of the dentist in Finding Nemo would be written as
<p>P. Sherman<br>42 Wallaby Way<br>Sydney</p>
Attributes are specified on the start tag. For example, the abbr element, which represents an abbreviation, expects a title attribute with its expansion. This would be written as
<abbr title="Hyper Text Markup Language">HTML</abbr>
Contents |
There are multiple kinds of HTML elements: void elements, raw text elements, and normal elements.
Void elements only have a start tag, which contains any attributes. One example is the link element, for which the syntax is
<link rel=stylesheet href=fancy.css type="text/css">This link element points the browser at a stylesheet to use when presenting the HTML document to the user. Note that in the HTML syntax, attributes don't have to be quoted. When using the XML syntax (XHTML), on the other hand, all attributes must be quoted, and a trailing slash is required before the last angle bracket:
<link rel="stylesheet" href="fancy.css" type="text/css" />Raw text elements are constructed with:
<tag>) marking the beginning of an element, which may incorporate any number of attributes;</tag>. In some versions of HTML, the end tag is optional for some elements. The end tag is required in XHTML.Normal elements usually have both a start tag and an end tag, although for some elements the end tag, or both tags, can be omitted. It is constructed in a similar way:
<tag>) marking the beginning of an element, which may incorporate any number of attributes;</tag>.Attributes define desired behavior or indicate additional element properties. Most attributes require a value. In HTML, the value can be left unquoted if it doesn't include spaces (name=value), or it can be quoted with single or double quotes (name='value' or name="value"). In XML, those quotes are required. Boolean attributes, on the other hand, don't require a value to be specified. An example is the checked for checkboxes:
<input type=checkbox checked>In the XML syntax, though, the name should be repeated as the value:
<input type="checkbox" checked="checked"/>Informally, HTML elements are sometimes referred to as "tags" (an example of synecdoche), though many prefer the term tag strictly in reference to the markup delimiting the start and end of an element.
Element (and attribute) names may be written in any combination of upper or lower case in HTML, but must be in lower case in XHTML.[1] The canonical form was upper-case until HTML 4, and was used in HTML specifications, but in recent years, lower-case has become more common.
HTML elements are defined in a series of freely-available open standards issued since 1995, initially by the IETF and subsequently by the W3C.
Since the early 1990s, manufacturers of user agents (e.g. web browsers) have often developed their own elements, some of which have been adopted in later standards. Other user agents may not recognise non-standard elements, and they may be ignored or displayed improperly.
In 1998, XML (a simplified form of SGML) introduced mechanisms to allow anyone to develop their own elements and incorporate them in XHTML documents, for use with XML-aware user agents.[2]
Subsequently, HTML 4.01 was rewritten in an XML-compatible form, XHTML 1.0 (eXtensible HTML). The elements in each are identical, and in most cases valid XHTML 1.0 documents will be valid or nearly-valid HTML 4.01 documents. This article mainly focuses on real HTML, unless noted otherwise; however, it remains applicable to XHTML. (See HTML for a discussion of the minor differences between the two).
Since the first version of HTML, several elements have become outmoded, and are deprecated in later standards, or do not appear at all, in which case they are invalid (and will be found invalid, and perhaps not displayed, by validating user agents).[3]
At present, the status of elements is complicated by the existence of three types of HTML 4.01 / XHTML 1.0 DTD:
The first Standard (HTML 2.0) contained four deprecated elements, one of which was invalid in HTML 3.2. All four are invalid in HTML 4.01 Transitional, which also deprecated a further ten elements. All of these, plus two others, are invalid in HTML 4.01 Strict. While the frame elements are still current in the sense of being present in the Transitional and Frameset DTDs, there are no plans to preserve them in future standards, as their function has been largely replaced, and they are highly problematic for user accessibility.
(Strictly speaking, the most recent XHTML standard, XHTML 1.1 (2001), does not include frames at all; it is approximately equivalent to XHTML 1.0 Strict, but also includes the Ruby markup module.)[4]
A common source of confusion is the loose use of deprecated to refer to both deprecated and invalid status, and to elements which are expected to be formally deprecated in future.
In keeping with the principle of separation of concerns, the function of HTML is primarily to add structural and semantic information to the raw text of a document. Presentation and behaviour are separate functions, which can be added as desired, ideally through links to external documents such as stylesheets, graphics files, and scripts.
This allows the document to be presented by different user agents according to their purposes and abilities; for example, a user agent can select an appropriate stylesheet to present a document by displaying on a monitor, printing on paper, or to determine speech characteristics in an aural user agent. The structural and semantic functions of the markup remain identical in each case.
Historically, user agents did not always support these features. In the 1990s, as a stop-gap, presentational elements were added to HTML, at the cost of creating problems for interoperability and user accessibility. This is now regarded as outmoded and has been superseded by stylesheet-based design; most presentational elements are now deprecated.[5]
External image files are incorporated with the img or object elements. (With XHTML, the SVG language can also be used to write graphics within the document, though linking to external SVG files is generally simpler.)[6] Where an image is not purely decorative, HTML allows replacement content with similar semantic value to be provided for non-visual user agents.
An HTML document can also be extended through the use of scripts to provide additional behaviours beyond the abilities of HTML hyperlinks and forms.
The elements style and script, with related attributes, provide reference points in HTML markup for links to stylesheets and scripts. They can also contain instructions directly.
<html>...</html>
<head>...</head>
<body>...</body>
<base>
href and other links in the document. Must appear before any element that refers to an external resource. HTML permits only one base element for each document. The base element has attributes, but no contents.<basefont> (deprecated)
font elements. Deprecated in favor of stylesheets.<isindex> (deprecated)
isindex could either appear in the document head or in the body, but only once in a document. Requires a server-side search engine.<link>
<link rel="stylesheet" type="text/css" href="url" title="description_of_style">[8]<link rel="next" href="url">head element may contain any number of link elements. The link element has attributes, but no contents.meta elements specify associative key-value pairs.meta elements can specify HTTP headers which should be sent by a web server before the actual content, for example:<meta http-equiv="foo" content="bar">foo that has a value bar.meta element specifies name and associated content attributes describing aspects of the HTML page. To prevent possible ambiguity, an optional third attribute, scheme, may be supplied to specify a semantic framework that defines the meaning of the key and its value: for example:<meta name="foo" content="bar" scheme="DC">meta element identifies itself as containing the foo element, with a value of bar, from the DC or Dublin Core resource description framework.<object>…</object>
head element, it could potentially be used to extract foreign data and associate it with the current document.<script>…</script>
src attribute.[9] Also usable in the document body to dynamically generate either both block or inline content.<style>…</style>
<style type="text/css"> … </style>@import directives of the form:<style> @import url; </style>[10]<title>…</title>
title element must not contain other elements, only text. Only one title element is permitted in a document.In visual browsers, displayable elements can be rendered as either block or inline. While all elements are part of the document sequence, block elements appear within their parent elements:
Conversely, inline elements are treated as part of the flow of document text; they cannot have margins, width or height set, and do break across lines.
<p>…</p>
P existed in HTML Tags, and was standardised in HTML 2.0; still current.<h1>…</h1> <h2>…</h2> <h3>…</h3> <h4>…</h4> <h5>…</h5> <h6>…</h6>
<h1> delimits the highest-level heading, <h2> the next level down (sub-section), <h3> for a level below that, and so on to <h6>.<dl>…</dl>
DL existed in HTML Tags, and was standardised in HTML 2.0; still current.<dt>…</dt>
DT existed in HTML Tags, and was standardised in HTML 2.0; still current.<dd>…</dd>
DD existed in HTML Tags, and was standardised in HTML 2.0; still current.<ol>…</ol>
type attribute can be used to specify the kind of ordering, but stylesheets give more control: {list-style-type: foo}. The default is Arabic numbering.OL existed in HTML Internet Draft 1.2, and was standardised in HTML 2.0; still current.<ul>…</ul>
{list-style-type: foo}. The default marker is a disc.UL existed in HTML Tags, and was standardised in HTML 2.0; still current.<li>…</li>
LI existed in HTML Tags, and was standardised in HTML 2.0; still current.<dir>…</dir> (deprecated)
<ul>.DIR existed in HTML Tags, and was standardised in HTML 2.0; deprecated in HTML 4.0 Transitional; invalid in HTML 4.0 Strict.<menu>…</menu> (deprecated)
<ul> list, but badly supported. Deprecated in favor of <ul>.MENU existed in HTML Tags, and was standardised in HTML 2.0; deprecated in HTML 4.0 Transitional; invalid in HTML 4.0 Strict.<address>…</address>
ADDRESS existed in HTML Tags, and was standardised in HTML 2.0; still current. <blockquote>…</blockquote>
cite attribute may give the source, and must be a fully qualified Uniform Resource Identifier.q) element.BLOCKQUOTE existed in HTML Internet Draft 1.2, and was standardised in HTML 2.0; still current. See blockquote element for more information.<center>…</center> (deprecated)
<div> or another element with centering defined using stylesheets.<del>...</del>
<div>…</div>
<hr>
<ins>...</ins>
<noscript>...</noscript>
<pre>…</pre>
pre, whitespace should be rendered as authored. (With the CSS properties: {white-space: pre; font-family: monospace;}, other elements can be presented in the same way.) This element can contain any inline element except: image (IMG), object (OBJECT), big font size (BIG), small font size (SMALL), superscript (SUP), and subscript (SUB).PRE existed in HTML Internet Draft 1.2, and was standardised in HTML 2.0; still current.<script>...</script>
SCRIPT is not itself either a block or inline element; by itself it should not display at all, but it can contain instructions to dynamically generate either both block or inline content.Inline elements cannot be placed directly inside the body element; they must be wholly nested within block-level elements [11].
<a>…</a>
href (hypertext reference [1]), the anchor becomes a hyperlink to either another part of the document or another resource (e.g. a webpage) using an external URL.name or id attributes set, the element becomes a target. A Uniform Resource Locator can link to this target via a fragment identifier. Any element can now be made into an anchor by using the id attribute,[2] so using <a name="foo"> is not necessary.title may be set to give brief information about the link:<a href="URL" title="additional information">link text</a>title appears in some way (varies according to browser). Some browsers render alt text the same way, though this is technically incorrect.A existed in HTML Tags, and was standardised in HTML 2.0; still current.<abbr>…</abbr>
<abbr title="abbreviation">abbr.</abbr><acronym>…</acronym>
abbr element, but marks an acronym:<acronym title="Hyper-Text Markup Language">HTML</acronym><dfn>…</dfn>
DFN existed in HTML Internet Draft 1.2, and was fully standardised in HTML 3.2; still current.<em>…</em>
EM existed in HTML Internet Draft 1.2, and was standardised in HTML 2.0; still current.<strong>…</strong>
STRONG existed in HTML Internet Draft 1.2, and was standardised in HTML 2.0; still current.These elements are useful primarily for documenting computer code development and user interaction through differentiation of source code (<code>), source code variables (<var>), user input (<kbd>), and terminal output (<samp>).
<code>…</code>
Code snippet.CODE existed in HTML Internet Draft 1.2, and was standardised in HTML 2.0; still current.<samp>…</samp>
SAMP existed in HTML Internet Draft 1.2, and was standardised in HTML 2.0; still current.<kbd>…</kbd>
KBD existed in HTML Internet Draft 1.2, and was standardised in HTML 2.0; still current.<var>…</var>
VAR existed in HTML Internet Draft 1.2, and was standardised in HTML 2.0; still current.As visual presentational markup only applies directly to visual browsers, its use is discouraged. Stylesheets should be used instead. Several of these elements are deprecated or invalid in HTML 4 / XHTML 1.0, and the remainder are invalid in the current draft of XHTML 2.0. The current draft of HTML 5, however, includes <b>, <i>, and <small>.
<b>…</b>
{font-weight: bold}.) <strong>...</strong> usually has the same effect in visual browsers, as well as having more semantic meaning.B existed in HTML Internet Draft 1.2, and was standardised in HTML 2.0; still current.<i>…</i>
{font-style: italic}.) <em>...</em> usually has the same effect in visual browsers, as well as having more semantic meaning.I existed in HTML Internet Draft 1.2, and was standardised in HTML 2.0; still current.<big>…</big>
{font-size: larger}<small>…</small>
{font-size: smaller} and <strike>…</strike> (deprecated)<s>…</s> (deprecated)
{text-decoration: line-through})STRIKE was standardised in HTML 3.2; deprecated in HTML 4.0 Transitional; invalid in HTML 4.0 Strict.S is deprecated in HTML 4.0 Transitional (having not appeared in any previous standard), and is invalid in HTML 4.0 Strict.<tt>…</tt>
{font-family: monospace;})TT existed in HTML Internet Draft 1.2, and was standardised in HTML 2.0; still current.<u>…</u> (deprecated)
{text-decoration: underline})U existed in HTML Internet Draft 1.2, and was standardised in HTML 2.0; deprecated in HTML 4.0 Transitional; invalid in HTML 4.0 Strict.<font>…</font> (deprecated)
<font [color=color] [size=size] [face=face]>…</font>color attribute, typeface with the face attribute, and absolute or relative size with the size attribute.<font color="green">text</font> creates green text.<font color="#114499">text</font> creates text with hexadecimal color #114499.<font size="4">text</font> creates text with size 4. Sizes are from 1 to 7. The standard size is 3, unless otherwise specified in the <body> or other tags.<font size="+1">text</font> creates text with size 1 bigger than the standard. <font size="-1">text</font> is opposite.<font face="Courier">text</font> makes text with Courier font.<font size="N"> corresponds to {font-size: Yunits} (the HTML specification does not define the relationship between size N and unit-size Y, nor does it define a unit).<font color="red"> corresponds to {color: red}<font face="Courier"> corresponds to {font-family: "Courier"}<span>…</span>
<br>
<bdo>…</bdo>
<cite>…</cite>
<del>…</del>
<ins>…</ins>
<del>'d text. Typically rendered underlined: Inserted text.<ins> and <del> elements may also be used as block elements: containing other block and inline elements. However, these elements must still remain wholly within their parent element to maintain a well-formed HTML document. For example deleting text from the middle of one paragraph across several other paragraphs and ending in a final paragraph would need to use three separate <del> elements. Two <del> elements would be required as inline element to indicate the deletion of text in the first and last paragraphs, and a third, used as a block element, to indicate the deletion in the intervening paragraphs.<q>…</q>
BLOCKQUOTE). Quote elements may be nested.<q> should automatically generate quotation marks in conjunction with Stylesheets. Practical concerns due to browser non-compliance may force authors to find work-arounds.cite attribute gives the source, and must be a fully qualified URI.blockquote) using stylesheets. For example, with a suitable CSS rule associated with q.lengthy:<q class="lengthy">An inline quotation of significant length (say 25 words, for example) goes here...</q><script>...</script>
<script> is not itself either a block or inline element; by itself it should not display at all, but it can contain instructions to dynamically generate either both block or inline content.<sub>…</sub> and <sup>…</sup>
{vertical-align: sub} or {vertical-align: super}.)<applet>…</applet> (deprecated)
<object>, as it could only be used with Java applets, and had accessibility limitations.<object> are not consistent between different browsers.<area>
<img>
src attribute specifies the image URL. The required alt attribute provides alternative text in case the image cannot be displayed.[13] (Though alt is intended as alternative text, Microsoft Internet Explorer renders it as a tooltip if no title is given. Safari and Google Chrome, on the other hand, do not display the alt attribute at all.)[14] img was proposed by Marc Andreessen and implemented in the NSCA Mosaic web browser.[15]<map>…</map>
<object>…</object>
type attribute. This may be in any MIME-type the user agent understands, such as an embedded HTML page, a file to be handled by a plug-in such as Flash, a Java applet, a sound file, etc.<param>
These elements can be combined into a form or in some instances used separately as user-interface controls; in the document, they can be simple HTML or used in conjunction with Scripts. HTML markup specifies the elements that make up a form, and the method by which it will be submitted. However, some form of scripts (server-side, client-side, or both) must be used to process the user’s input once it is submitted.
(These elements are either block or inline elements, but are collected here as their use is more restricted than other inline or block elements.)
<form action="url">…</form>
<button>…</button>
<fieldset>…</fieldset>
<input>
<button> is preferred if possible (i.e. if the client supports it) as it provides richer possibilities.src attribute.size attribute specifies the default width of the input in character-widths. maxlength sets the maximum number of characters the user can enter (which may be greater than size).<isindex> (deprecated)
isindex could either appear in the document head or in the body, but only once in a document. Requires a server-side search engine.<label for="id">…</label>
<legend>…</legend>
<option value="x">
select list.<optgroup>…</optgroup>
<select name="xyz">…</select>
<textarea rows="8">…</textarea>
cols (where a col is a one-character width of text) and rows attributes. The content of this element is restricted to plain text, which appears in the text area as default text when the page is loaded.The format of HTML Tables was proposed in the HTML 3.0 Drafts and the later RFC 1942 HTML Tables. They were inspired on the CALS Table Model. Some elements in these proposals were included in HTML 3.2; the present form of HTML Tables was standardised in HTML 4. (Many of the elements used within tables are neither block nor inline elements.)
<table>…</table><tr>…</tr><th>…</th><td>…</td><colgroup>…</colgroup><col> or <col/><caption>…</caption><thead>…</thead><tbody>…</tbody><tfoot>…</tfoot><thead>, this section may be repeated by the user agent if the table is split across pages (in printing or other paged media).Frames allow a visual HTML Browser window to be split into segments, each of which can show a different document. This allows for lower bandwidth use, as repeating parts of a layout can be used in one frame, while variable content is displayed in another. This comes at a significant usability cost, especially in non-visual user agents. Because of this cost, frames (excluding the iframe element) are only allowed in HTML 4.01 Frameset.
In HTML 4.01, a document may contain a head and a body or a head and a frameset, but not both a body and a frameset. However, iframe can be used in a normal document body.
<frameset>…</frameset>rows and cols attributes.<frame> or <frame/>frameset. A separate document linked with the src attribute appears inside.<noframes>…</noframes><iframe>…</iframe>object element, an inline frame can be the "target" frame for links defined by other elements and it can be selected by the user agent as the focus for printing, viewing its source etc.The following elements were part of the early HTML developed by Tim Berners-Lee from 1989–91; they are mentioned in HTML Tags, but deprecated in HTML 2.0 and were never part of HTML standards.
<listing>…</listing><plaintext><xmp>…</xmp>pre.plaintext cannot have an end tag – it terminates the markup and causes the rest of the document to be parsed as if it was plain text.<nextid>…</nextid>nextid existed in HTML Tags (described as obsolete); deprecated in HTML 2.0; invalid in HTML 3.2 and later.This section lists some widely-used obsolete elements, which means they are not allowed to be used. They may not be supported in all user agents.
<blink>…</blink>{text-decoration: blink} (This effect may have negative consequences for people with epilepsy[16].; its use on the public internet should follow the appropriate guidelines.)<marquee>…</marquee><nobr>…</nobr>{white-space: nowrap;}<wbr> or <wbr/><noembed>…</noembed><!-- A Comment -->style and script elements are still sometimes surrounded by comment delimiters.<!--Xbegin<!--Y-->Xend--> will yield the comment Xbegin<!--Y and the text Xend--> after it.Some text and images from HTML elements at Wikipedia under the GFDL licence. 124575 bytes, 2010-09-01
a Bit About combines Wikipedia articles with Google searchYou can go straight to an article (Science, History, Art, Rock Music, ) e.g.
...or search and get results from Wikipedia and the web combined. Our UK Store.Follow aBitAbout on Facebook | Twitter |
2010 : September 3
|