homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Pubcon Platinum Sponsor 2014
Home / Forums Index / Code, Content, and Presentation / XML Development
Forum Library, Charter, Moderators: httpwebwitch

XML Development Forum

XML Feed import issue/question

WebmasterWorld Senior Member 10+ Year Member

Msg#: 3816211 posted 1:29 pm on Dec 30, 2008 (gmt 0)

I have a price comparison site, we import and parse XML feeds from different merchants.

The issue is that if we import an ipod nano from your <category> which is gadgets , and our site contains the category mp3 players.

Any know of a good solution to allow us to match categories so that we can match merchants category to our category?



WebmasterWorld Administrator httpwebwitch us a WebmasterWorld Top Contributor of All Time 10+ Year Member

Msg#: 3816211 posted 2:05 pm on Dec 30, 2008 (gmt 0)

It sounds like your categories are more granular than the ones used by the merchant, which means you can't simply pour all the "gadgets" into the "mp3 player" bin. But there may be some categories that map directly to yours, so find those and create a rule:

if (category is 'portable music devices'){
mycategory = 'mp3 players'

or perhaps you may need tables full of these rules

if (category in array('portable music devices','mp3','music players','ipods',...)){
mycategory = 'mp3 players'

If there is no obvious category mapping, then your job gets tougher. You've got an item, presumably with a title - like "ipod nano" - and you need to categorize it yourself.

I'd be inclined to write a script that checks an item's similarity to other items that already exist, based on the item's name and words in the description, then figure out the most likely category by looking at its nearest neighbours.

the technique is called "data clustering" and it's not for the faint of heart

Another alternative is to do it manually, which may be easier (depends how many items we're talking about)

Global Options:
 top home search open messages active posts  

Home / Forums Index / Code, Content, and Presentation / XML Development
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved