homepage Welcome to WebmasterWorld Guest from 54.204.73.126
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Code, Content, and Presentation / XML Development
Forum Library, Charter, Moderators: httpwebwitch

XML Development Forum

    
What is XML ?
malcolmcroucher

5+ Year Member



 
Msg#: 3698985 posted 4:09 pm on Jul 15, 2008 (gmt 0)

Hey ,

I just created a sitemap in XML but i dont know what xml is or does ?

How do normal pages link to an xml page ?

or do they ?

Regards

Malcolm

 

phranque

WebmasterWorld Administrator phranque us a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month



 
Msg#: 3698985 posted 6:51 am on Jul 16, 2008 (gmt 0)

to partially answer your first question:

xml is...
a ten year old specification and still going strong at version 1.0!
[webmasterworld.com...]

basically it is a spec for a markup language to define markup languages.
you can define and write html in xml, for example.

one other common usage is for defining a data set or structure for exchange purposes.

eelixduppy

WebmasterWorld Senior Member eelixduppy us a WebmasterWorld Top Contributor of All Time 5+ Year Member



 
Msg#: 3698985 posted 6:53 am on Jul 16, 2008 (gmt 0)

There are a lot of XML tutorials online, as well. A simple Google search will tell. You can also start at w3schools [w3schools.com] and go from there :)

httpwebwitch

WebmasterWorld Administrator httpwebwitch us a WebmasterWorld Top Contributor of All Time 10+ Year Member



 
Msg#: 3698985 posted 5:14 pm on Jul 16, 2008 (gmt 0)

It'll help me to explain things if I define some words for our purpose:
A "page" is something you'd load in a browser, like a layout with purty graphics and CSS, links, text, a logo, and some things you can click on. Pages are usually created using HTML and other ingredients. Web sites are usually made of pages. Your MySpace profile is a web page. Pages are usually (almost always) built using HTML, but the Page as a whole also includes all the images and scripts and stylesheets that make it look the way it does.

An XML file, or a URL which serves XML, is usually called a "Document". Don't worry why. But when I refer to an XML Document, I won't mean the same things as a "page".

One ingredient of a Page is the HTML Document. The Document is the DOCTYPE declaration (optional), the <html> tag (required), and everything inside that. In addition to the Document, all the extra stuff like images and stylesheets and scripts comprise a Page.

Both HTML and XML are constructed as Documents. Indeed there's a flavour of HTML, called XHTML, which *is* XML. As mentioned already, XML is a language which lets you create other languages... that's an awkward concept to grasp, so consider this analogy: LEGO is a toy which lets you create other toys. If XHTML is your assembled Millenium Falcon sculpture, XML is the blocks that let you create it. XML has its own rules for validity and syntax, just like LEGO has its own rules for how the blocks snap together. With LEGO you can build the Millenium Falcon, Hogwarts Castle, and a motorized truck. With XML, you can build HTML, RSS, MathML, XSLT, SVG, and dozens of other well-defined languages. All those languages have their own words and grammar, but they all obey the fundamental rules which we define as XML.

Unlike HTML, XML is primarily intended to be consumed by machines. The elements in an XML Document are named and nested in such a way that they are human-readable, but can also be read ("parsed") by a computer program to get the data defined within it.

An XML sitemap is not (usually) intended to be viewed as a Page. It's intended to be read by search engine robots, so they can easily find the pages in your site, request them, index them, and get your site ranking in their Search Engine. Put your XML sitemap in a place where they'll find it - more information about how to do that is available here [sitemaps.org].

When you view an XML Document in a web browser, it looks awful. Most browsers render it as a tree with nodes you can expand or collapse. The browser makes it readable, and useful, but it doesn't exude much character.
Here is an XML file that gets used a lot in online tutorials:
[w3schools.com...]

When you view your sitemap.xml in a browser, it should look like that.

There is a way to make an XML Document look more like a Page - and that is by defining a stylesheet for it written in XSLT*, and prettified with CSS. Here is an example of an XML Document [webmasterworld.com] that is being rendered in your browser like it's a Page. In fact, it's fair to call that an XML Page. If you View the Source of that page, you'll see that it's not HTML, it's XML.

* ( XSLT has notoriously steep learning curve, but those who pursue mastery of XSLT tend to find it's an indispensible part of a Webmaster's skillset. It's prerequisite to learn the basics of XML before trying XSLT. )

On many sites I've built, there are two places where you can find a site map:
http://www.example.com/sitemap.xml <- this is the one put there for Google and its robotic friends
http://www.example.com/sitemap.html <- this is the one for you. There's a link to it in the layout footer.

So, to answer your question (finally), yes you can link to an XML Document, and you can even link to an XML Page. Entire sites can be built in XML instead of HTML. However, "normal" pages don't usually do that.

Links to XML Documents are quite common - whenever you see an RSS feed button, that's a link to an XML Document. (RSS is one of those languages built with XML)

XML is a very rich subject, and it's used for lots of other things besides making sitemaps. go and explore!

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Code, Content, and Presentation / XML Development
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved