Forum Moderators: open

Message Too Old, No Replies

Anyone know of a word frequency counter?

         

gtodell

10:20 pm on Feb 24, 2006 (gmt 0)

10+ Year Member



Does anyone know of a program that can count the frequency of a particular word contained within the html (within metatags, etc) of an entire web site (online or offline)?

I have seen several programs that can do this one page at a time but I need something that can perform this task for an entire site.

Many thanks in advance.

Gordon

tedster

5:29 pm on Feb 25, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Homesite does this -- every document in a folder, including or excluding sub-folders as you choose. I mention it because some people may have it handy. But it would be an outlandish expense to purchase it for just this function.

In general, I'd suggest looking for programs that can do "extended search and replace" or "search multple files" -- as part of the core function, they often report on how many instances of the search term have been found.

gtodell

10:04 pm on Feb 25, 2006 (gmt 0)

10+ Year Member



Are you referring to Macromedia's Homesite? I thought the feature I need would only work for local web sites, not live sites that are not mine. Let me know if I am mistaken. FrontPage, which I already own, will do the counting I need but, again, only locally, or lives it is my site.

tedster

11:35 pm on Feb 25, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



You are correct -- offline only. I didn't understand that you wanted both functionalities, I was hypnotized by the "or".

An online check could be a tough program to write -- unless you can assume that all pages and folder have a link trail from the home page. But assuming that is the case, then why not rip all the html files down to a local copy (essentially that's what must happen anyway) and then run Homesite or FP over the downloaded files.

DrDoc

6:42 am on Feb 26, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Why do that, even?
Use Homesite or FP (or whatever you have) to count the word frequency locally. Then use Google to check how many times that word appears on live sites. For example, if you want to know how many times the fabulous word DrDoc appears here on WebmasterWorld:
site:webmasterworld.com DrDoc
;)

Or, if you have access to all the sites, just run a nifty shell command ... like

grep
. It will tell you the number.

henry0

2:34 pm on Feb 26, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



G search is a good tip.
However, correct me if I am wrong, wouldn't that query only ranked pages?

tedster

6:24 pm on Feb 26, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Yes, the Google idea is theoretically good, but it would only work for fully indexed pages. And I'm not also convinced of the accuracy of that number -- Google does say "about" 58,500 results. In fact, I've seen the number change as you click through to a deeper page of the results, possibly because of a change in data center.

gtodell

12:20 am on Feb 28, 2006 (gmt 0)

10+ Year Member



DrDoc,

Does grep (or other method) search WITHIN the html of the pages? This is critical for my purposes.

Thanks.

gtodell

2:08 am on Feb 28, 2006 (gmt 0)

10+ Year Member



DrDoc,

Got it! Windows Grep 2.3 does what I need - at least locally.

Thanks for the tip.