Tuesday, February 5, 2008

HTML 5 aims to formalize Semantic HTML

It’s kind of hard to believe that it’s almost been a decade since HTML 4.01 was made official. However, it has taken browsers a while to recognize the standard, and this process continues today. I was pleased to see that the W3C has released the first draft of HTML 5 with its goal of creating Semantic HTML.

Semantic HTML

The impetus behind Semantic HTML is the creation of HTML documents that are void of any reference to how the HTML is presented. This means that Semantic HTML contains only the implied meaning of data via the use of the appropriate tags. This has been the goal of the separation of content and presentation, and HTML 5 is pushing it closer to reality.

A couple examples of Semantic HTML include only using the table element for tabular data, and actual header tags (,,) are used to mark important data on a page. The creation of Semantic HTML introduces a number of new features and banishes some older elements.

New additions in HTML 5
HTML 5’s new additions focus on Semantic HTML by allowing Web developers to easily assign meaning to content via HTML tags. The following list provides a sampling of these elements:

  • article: Represents an independent piece of content within a page.
  • audio: Provides multimedia support.
  • canvas: Renders dynamic bitmap graphics on the fly, such as graphs, games, and so forth.
  • dialog: Marks up a conversation.
  • embed: Provides support for plug-in content.
  • footer: Represents the footer of a section.
  • header: Represents the header of a section.
  • nav: Represents a section of a document whose purpose is navigation.
  • section: Represents a generic section of a document.
  • video: Provides multimedia support.

This subset of new elements demonstrates that presentational HTML is gone. Turn your attention to CSS when approaching the task of properly delivering content to the user community. One worry with working with a new technology is backward compatibility.

The HTML 5 standard includes two conformance requirements for Web developers and the software used to view the content (the user agent). The user agent section includes guidelines to ensure older HTML is correctly rendered, so there should be no worries about breaking existing Web content.

Out with the old
A quick perusal of the first draft of HTML 5 shows that the push towards CSS for presentation continues, as relevant elements have been removed from HTML 5. These elements include font manipulation with the standard font tag, along with basefont and big.

You will no longer be able to use the underline element for underlining text, and strike is gone as well. It is interesting to see the bold and italics elements are still in place. One element that is often overused is; beginning with HTML 5, you should use CSS to horizontally center elements.

Numerous attributes have also been removed; for instance, the align, background, and bgcolor attributes have been removed from the body element.

I know the removal of frame-related elements (frame, frameset, and noframes) will receive a resounding cheer from Web developers. The negative impact on accessibility and usability were the final straw for frames. Also, the applet tag has been removed, and the object tag is now the standard.

This is just a quick list of changes; the W3C site offers a detailed list of what has changed from HTML 4 to HTML 5.

cheers Aurobindo!!!





AddThis Feed Button