Forum Moderators: phranque
How would you grab remote files and search those? You build a spider that stores the pages where? What would you keep and what would you throw away? Do you keep it in a database? How do you keep the keyword density, or is that measured at search time? I have always wondered about site search and search engines programatically, but have just never asked.
Not really looking for code just the theory behind it?
[edited by: korkus2000 at 1:02 pm (utc) on Oct. 3, 2003]
I got myself all over the place with this. But there are some decent links/comments in there.