Forum Moderators: open

Message Too Old, No Replies

The perfect website traffic analyzer

dream up your own

         

wardbekker

3:20 pm on Jan 19, 2002 (gmt 0)

10+ Year Member



The traffic analyzer programs I know (livestats, webtrends etc) don't have all the functionality I need, for instance:

- When a spider visits my websites, which pages are visited per 'session'
- per page referral overview
- per page overview of keywords used in the SE
- per page overview of all known referrals
- sql query interface with some general pre-made scripts & custom scripts

What features would you, hard working SEO's, want to make your work easier? [;-)]

amoore

4:19 pm on Jan 19, 2002 (gmt 0)

10+ Year Member



- sql query interface

Have you tried using the Apache mod_perl module DBI::Logger? It writes your logs to an SQL database (I use MySQL) so that you can pull arbitrary stuff from it using SQL.

I have been using it (or a similar perl module) for about a year with great results. I can pull all kinds of information out of it that you can't usually get without some really creative uses of grep, awk, sed, cut and all those other text processing commands.

I know that it's not a direct answer to your question, but if that's one thing you're looking for...

wardbekker

4:27 pm on Jan 19, 2002 (gmt 0)

10+ Year Member



Not what i was looking for ;-)

I want to invite you to sum up your 'dream' features of a good traffic analyzer for SEO's = A brainstorm ;-)

skiguide

11:36 pm on Jan 21, 2002 (gmt 0)

10+ Year Member



so if i tell you everything i want, can you build it??

I've been on the lookout for a high-level tracking tool that would allow us as SEO's to (i guess download raw log files) from client sites that are hosted on various servers (not our own, as we are not a development firm - & they usually hire us after the site is built/hosted) but - i have no idea what resources it would take for us to do that with as many as 25-50 clients at any given time.

I want something that will track search engine to search phrase (to a specific page) to sale, show visitor paths through the website and include bail-out points, accurately separate 'visitor sessions' from spiders, allow me to choose various time/date ranges, and generate easily readable reports to show clients via an extranet or something.

i'm sure there's more requirements I had, but it's late in the day, and that'll give you something to chew on for now.

ggrot

12:28 am on Jan 22, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Well, in the netherworld of 'perfection', the ultimate traffic analyzer would take it to the next logical step. Instead of having the webmaster look at useful data and make relevant changes/reports, the logger would test and adjust pages automagically for you leading to the optimal SEO and sales copy combination for greatest traffic and coversion. It would optimize a curve with these two variables for a third variable = net profit. It would basically 'evolve' your site with profit being the darwinian test of fitness. Of course, it would send you a progress report at various intervals to let you know what changes have been made and what the improvements were.

Unfortunately, if there were an individual with such programming skills as to create such an intelligent system, many of us would be out of a job.

Tapolyai

1:23 am on Jan 22, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Let me fetch it even further. The traffic analyzer would analyze all traffic of all web sites, and all other electronic media at the individual level 24/7/365.

The system then would automagically adjust all our servicesto the indivials best and most desired needs, making whatever we sell irresistable. People would pay $20 for a single paper clip...

Tapolyai

1:26 am on Jan 22, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



On second thought... I am removing all my log analyzers, and turning off logging!

wardbekker

8:43 am on Jan 22, 2002 (gmt 0)

10+ Year Member



" so if i tell you everything i want, can you build it?? "

I CAN build it (i'm a senior web application programmer). But when brainstorming i've learned not to think of the actual technical implementation, it slows down the creativity ;-) I'm more interested in great ideas in this stage.

ggrot :

Great idea! I don't think you would be out of your job. It's a great tool for a great human mind. The first initial keywords should be chosen by the SEO. Keywords vistors used @ google the are not in the initial 'lists' are saved by the system & tried out. That can cause 'dirty' keywords (completely off-theme). The SEO has to monitor new keywords. If your website doesn't show up @ certain keywords that you don't have in your listing, the system can never find those new ones out (do you got an idea to accomplish this?).But it would be still a very handy tool.

Great idea's people! Could still use a lot more input. Dream on! ;-)

Robert Charlton

7:20 am on Jan 23, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



I'd be happy to see skiguide's suggestions implemented.

ralnikov

10:45 am on Jan 23, 2002 (gmt 0)

10+ Year Member



It's not very hard to implement all those suggestions.
But a couple of questions:
- What platform do you use?
- How much $$$ you could pay for such software? :)
- How many people are interested?

wardbekker

11:00 am on Jan 23, 2002 (gmt 0)

10+ Year Member



windows, depends, dunno

Why do you ask?

paynt

2:47 pm on Jan 23, 2002 (gmt 0)



The one feature I like particularly is when I can drop any raw logs in and analyze them without having to register every site I work on with the company I purchased the software from.

ROLAND_F

4:26 pm on Jan 23, 2002 (gmt 0)

10+ Year Member



The perfect tool ?
It would plug in Apache, and send all info about HTTP request to a central server to gather stats in real time instead of these dumb logfiles that are allways outdated.
Wouldn't it be nice to have nice realtime graphics like traders have ?
Something like 3 or 4 flat panel displays that show how well your websites do. (Graphic of referrers , Keywords used over time to reach your website, search engine, ...).
It would be able to analyse current traffic and do automatic optimization of content in a kind of feedback loop.
(You could build a list of words with almost the same meaning that can be put in place of each other so automatic substitution could be achieved.)

vitaplease

4:37 pm on Jan 23, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



While you guys are busy programming..

please include:

automatically next to every keyphrase logged:

1. Overture and Wordtracker # search stats per month of that keyphrase

2. SERP position within the search engine that gave you that referral for that specific search phrase.

and I aggree with Roland - make it live!

ggrot

5:36 pm on Jan 23, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



>> make it live!

The problem with live statistics is load scaling. Imagine you are running amazon.com and every request is sent to a secondary server database to log the traffic. This is fine and dandy if your server(s) can handle the requests, which is reasonable to assumme.

But then assumme that you want to run a report that requires a summary of all the file requests in the database. You want a snapshot at an instant of time nearby the time you submitted the request. Thus, once you start processing the report request you have to lock the database for the duration of that request. Since your report may easily have to access the full range of values a few times(which would be billions of requests), it is logical to assumme this could take a few seconds during which no other processes would be able to access the database. If you interleave this process with updates, you will have a corrupt report.

This is not necessarily a problem by itself, if you have a good queueing system as most modern databases do, the logging requests can be handled after report generation has completed. The problem lies in what other things this database is used for. In amazon's case, it displays products on the site that you have browsed for before and related products. To do this, the scripts would be requesting data from the database and due to the backlog might be waiting 30-40 seconds before it can generate the page. You just lost 100 customers there in that minute.

There are ways around all of this, but it is generally costly in hardware terms.

ROLAND_F

5:42 pm on Jan 23, 2002 (gmt 0)

10+ Year Member



Couldn't we compute the stats in realtime ?

For each new referrer add a new entry in a table in memory.
Each time you get a new visitor from this referrer, increment the counter.
At the end of some timescale (minute, hour, day, week, month, year)
save the result so you don't have to compute it a second time and scroll the graphics...

ggrot

8:04 pm on Jan 23, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Thats oversimplified stats. You don't know what time of day, day of week, etc the visitors arrived at. You don't know how well different referrer entry's converted. Lots of other possibilities. I'm not saying you can't do live stats. In fact, the sites I run have live stats, but we are talking only a few hundred sessions per day per site at most. I don't think that the system I'm using now (homebrewed) would scale very well for a high visibility site.

ROLAND_F

10:03 am on Jan 24, 2002 (gmt 0)

10+ Year Member



Ok so maybe it's possible to have non-realtime stats with great detail and realtime stats with less details.

wardbekker

10:10 am on Jan 24, 2002 (gmt 0)

10+ Year Member



Realtime stats are a cool thing, but you have to consider the real value of such a feature. I only need the most accurate stats when i'm taking time for research, and if that's data from every day till yesterday, i think i can manage ;-)

Real time warnings about errors, processor over load, spiders that are indexing etc etc are a different matter. I rather have an intelligent 'PUSH' system ;-) That minimizes my time spend

TallTroll

1:02 pm on Jan 24, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



On the live stats, even if you have a massive d/base, and report compiling takes a couple of hours, you may be able to get round that using a dupe d/base.

The live database uses a synch routine to update a secondary database. When a report is requested, the 2nd d/base locks out, and runs the report. When the report is done, the synch routine will be able to tell where the last update was, and synch the d/bases up again (maybe use a timestamp to control the synching?)