homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / WebmasterWorld / Accessibility and Usability
Forum Library, Charter, Moderators: ergophobe

Accessibility and Usability Forum

Semantic Data Extractor
HTML semantic rich documents.

 2:48 pm on Nov 25, 2006 (gmt 0)

The aim is to show that providing a semantically rich HTML gives much more value to your code: using a semantically rich HTML code allows a better use of CSS, makes your HTML intelligible to a wider range of user agents (especially search engines bots).

Have you been using this tool to your advantage? ;)

Semantic Data Extractor



 5:54 pm on Nov 25, 2006 (gmt 0)

I don't see how anybody could be using this tool to their advantage, because it is broken. That is, it just plain does not work, returning a error message.

I assume it worked until some recent update...

(I am referring to the demo on the website.)


 7:13 pm on Nov 25, 2006 (gmt 0)

Works just fine for me, jtara. What error message are you getting? Bear in mind that the tool expects a full URI to a resource, not just the domain name, ie. http://www.example.com/ rather than www.example.com.

The tool itself is a simple and interesting little utility to see if you can extract the correct meaning from the way the page uses markup. It can be useful in pointing out potentially confusing associations. A good example which I tried showed that the contents of a sidebar (using h4 elements for each header) were seen as being appended on the final node created by the article links in the main content area (which were marked up with h3 elements). The fix would be to have a h2 or h3 element introducing the sidebar sub-headings.


 8:21 pm on Nov 25, 2006 (gmt 0)

I get this error message:

Using org.apache.xerces.parsers.SAXParser
Exception net.sf.saxon.trans.DynamicError: org.xml.sax.SAXParseException: Content is not allowed in prolog.
org.xml.sax.SAXParseException: Content is not allowed in prolog.


 8:36 pm on Nov 25, 2006 (gmt 0)

It shows the meta data and the organizational outline for the page. This is useful, and it's also available as an option when you validate your HTML.

Global Options:
 top home search open messages active posts  

Home / Forums Index / WebmasterWorld / Accessibility and Usability
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved