homepage Welcome to WebmasterWorld Guest from 23.22.128.96
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Pubcon Platinum Sponsor 2014
Home / Forums Index / Microsoft / Bing Search Engine News
Forum Library, Charter, Moderators: mack

Bing Search Engine News Forum

    
Sitemap For MSN
I have a nice Google XML sitemap, want to convert for MSN
mboydnv




msg:1539624
 8:56 pm on Jun 24, 2006 (gmt 0)

I have about 1,000 pages to make a sitemap for MSN (and Yahoo too).

I have a nice Google XML sitemap built (Thanks to wordpress plugin).

Basically, I can fetch the links from it via url extractor and clean it with a text editor, but I can't seem to find any definitive info on creating a sitemap for yahoo and msn. Could someone point me to a Hands on Tutorial (with code examples) for the format.

Do i use bullets? Can I just put links on the page (how many etc) do I have to put Anchor text for those links, make catagories? Etc etc.

I basically just want to toss up just the links.

like this:

<a href="http://www.mydomain.com/mypage2.html"></a>
<br>
<a href="http://www.mydomain.com/mypage.html"></a>

Can I do that?

Cheers

 

tigersoft




msg:1539625
 11:16 pm on Jun 24, 2006 (gmt 0)

Hi,

For yahoo sitemap you can use "Site Map Builder .Net" program.

But i am also looking for info to create sitemap for Msn. And where to submit sitemap for msn.

mboydnv




msg:1539626
 1:00 am on Jun 25, 2006 (gmt 0)

Ummm, thanks for the plug to some windows software, but I am on a Macintosh, and I don't mind doing this by hand.

I need someone here to post some sample code from one of their successful sitemaps, so i can finish the doc here on my end.

Anyone willing to share?

Thanks

katheesue




msg:1539627
 5:46 pm on Jun 25, 2006 (gmt 0)

It should be fairly simple to use the sitemap.xml file and do a search and replace for multiple things.

It is trivial if you have a spreadsheet.

<url>
<loc>http://www.<example>.com/</loc>
<lastmod>2006-06-23T17:46:32+00:00</lastmod>
<changefreq>daily</changefreq>
<priority>1</priority>
</url>

Remove all lines which contain <lastmod>
Remove all lines which contain <changefreq>
Remove all lines which contain <priority>
Search for <url> replace with <li>
Search for </url> replace with </li>
Search for <loc> replace with <a href=">
Search for </loc> replace with ">sometext</a>

In a spreadsheet, you can write a formula which copies the url again into the location which says "sometext" and be done. Otherwise, you need to find a way to put a descriptive, useful title into that location.

Once you have done that, you can simply add <ul> at the top and </ul> at the bottom of the list of urls and remove everthing else, format the page the way you want it to look, split it into ten or twelve pages with a navigation system (try to keep less than 100 links per page) and you should be done.

katheesue




msg:1539628
 5:48 pm on Jun 25, 2006 (gmt 0)

PS: since OS X is UNIX-like, you might be able to use sed and grep and ¦ and > and things like that to do this in one line of code.

mboydnv




msg:1539629
 7:22 pm on Jun 25, 2006 (gmt 0)

Thank you KathheeSue!

Was just gonna ask you if you knew the grep command, been a while since I've done Grep...

<lastmod>2006-06-23T17:46:32+00:00</lastmod>

obviously the time changes on each one

<lastmod>Grep Comman Here * someting...etc </lastmod>

Than kyou for taking the time to care and share...!

katheesue




msg:1539630
 12:03 am on Jun 26, 2006 (gmt 0)

If you grep for the line containing <lastmod> and exclude those lines from the stream, you can then pipe that to the next command.

Be sure to make a backup copy before you do any of this.

...

ummmmmm

....

grep -v <pattern> <filename> >> newfile.txt

?

so

grep -v lastmod originalfile.txt > newfile.txt

Once everything is gone that you don't want, maybe sed would work better for the substitution part?

mboydnv




msg:1539631
 1:25 am on Jun 26, 2006 (gmt 0)

It's Greek to me, thanks though....

quotations




msg:1539632
 1:47 am on Jun 26, 2006 (gmt 0)

put the sitemap in a directory by itself.

run this command:
grep -v lastmod sitemap.xml > newfile.txt

run this command
grep -v changefreq newfile.txt > newfile2.txt

run this command:
grep -v priority newfile2.txt > newfile3.txt

Now you should have a file called newfile3.txt which does not contain the lines which you don't want.

Pull it into a text editor and do search and replace for the other items mentioned above.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Microsoft / Bing Search Engine News
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved