homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Code, Content, and Presentation / Perl Server Side CGI Scripting
Forum Library, Charter, Moderators: coopster & jatar k & phranque

Perl Server Side CGI Scripting Forum

Settle pointless pub disputes with mechanize
Mongering for the masses ... how to mashup for blokely advantage

 3:57 pm on May 31, 2007 (gmt 0)

Alright Perlsters - How would one go about the following?

a) Get from Wikipedia a list of persons categorised in a certain way eg. Movie stars born in Iowa;
b) compare that list with an integer taken from their entry in imdb eg. Number of films they have appeared in;
c) put them in a table listed by order of that integer eg. movie stars born in Iowa sorted by number of films they appeared in.
d) Update the data daily.

Is this a task for mechanize? Someone must have done this already :D

-- Case



 9:36 pm on May 31, 2007 (gmt 0)

Sounds like something WWW::Mechanize can handle.


 1:13 am on Jun 1, 2007 (gmt 0)

It is also possible to make an API for IMDb, here:

But I wasn't very impressed with it. I searched the IMDb site for info but nothing on it so I don't know how he's done that to tweak it for an actor search.


 10:23 am on Jun 2, 2007 (gmt 0)

Thanks guys. Wikipedia also has an api - well, more of a fake-browser really - but they haven't yet implemented the ability to draw down lists of articles in various categories :( It's interesting that IMDB has a TOS prohibition on outright spidering, but they give away their entire dataset. It's also quite odd that they don't yet have an api.

Global Options:
 top home search open messages active posts  

Home / Forums Index / Code, Content, and Presentation / Perl Server Side CGI Scripting
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved