Forum Moderators: open
Mos of the links look like that:
<a class=mainlisttitle href="/index.php?action=read&sez=&id=309">text</a>
I'll put the complete URL, if someone wants to give it a check, I would really appreaciate:
www ilbisturi it
by purpose miswrote it, so I won't look like a spammer :)
TIA,
Manuele
Hope this helps.
recently Google was more likely to spider URLs with several parameters. Try to keep the number low (2 or 3 will do for Google).
Thanks,
swerve
But the most important thing is to avoid a parameter with a name like 'id'. This indicates a session-id, and following those links would result in spidering many (almost) identical pages.
Hmm... this is interesting to know. I'm having trouble getting Google to spider my sub-pages, even without a query string. The URLs look something like this:
[......]
I wonder - could Google be smart enough to realize I'm using the path as a replacement for the parameters and notice the ciid?
He means the status code that your HTTP server responds with when something makes a request for a page. 200 means OK, 404 means not found, 500 means internal server error, etc.
My site sends back a 200 response code for both the page I'm trying to get it to spider and the page that links to it.
But the other engines spider it OK, so I'm thinking maybe it's just another part of the wackiness of this latest update...
still no spidering of the subpages...
can someone give me another good hint?
www ilbisturi it
is the page.
Thanks again
Manuele
<a href="/index.php?action=read&sez=100&id=274"> but the current page looks better
<a href="/read/274/"> A check at Server Header Check [webmasterworld.com] gave a 'HTTP/1.1 200 OK' so that looks good. Maybe just wait some longer for Google to get the sub pages as well. By the way, you hardly have links to your site (Google, AllTheWeb, AltaVista, Inktomi all say: 0 links). Having some more could also help to get sub pages spidered.
I did this to make the pages look all in the same subdirectory (better for many things...)
Now I'm waiting for the next spidering to see what happens...
(Those other links - downloads and stuff - are ok to be not spidered... so id can remain ... I want to first solve the main problem and see what will happen later...)
Thank you again... will let know :)
Thanks for explaining. I thought that was what he meant but the way he said it lost me.
Anyone have a good idea why Google crawled me back in April but the pages have no title or cache in their directory? I keep asking hoping someone has the magic solution. Think Google will get the content next time around?
Thanks
new spidering, still only "/"...
I'm getting very upset :///
Any help appreciated...
2) Second question is a bit more complicated:
I have archive pages for news (archive_SECTION_OFFSET.html)
where SECTION and OFFSET are numbers...
The point is that with an OFFSET equal to 0 you get the newest archive page: this means that archive_3_0.html is always changing ... and that is the page I reference from the HP.
Now, for as much as I could see, google is not following the link that is at the bottom of the archive page (it is an image only anchor - left and right arrows) so it is loosing all the older archives (i.e.: archive_3_1.html, archive_3_2.html and so on...) and because of that it also looses a lot of content....
In order to avoid that i can see 2 possible ways:
- change the naming so that archive_SECTION_0.html is the OLDEST archive and archive_SECTION_9.html (for example) is the freshest page, linked from the home. In this scenario i would hope that google spiders the site so often that it will get the archives at least before the OFFSET changes...
- or, alternatively, find out why google is not following the image-only anchor and make it spider the archive sub pages...
I hope I explained myself....
otherwise let me know, i'll try to improve my bad english as fast as possible... :)
TIA, again.
1) Is it normal to have a 24/30 hours delay between the spidering time and the time google shows the results?
The point is that with an OFFSET equal to 0 you get the newest archive page: this means that archive_3_0.html is always changing
google is not following the link that is at the bottom of the archive page
That is my configugaration for that site, it's in the <virtualhost> directive....
besides, i wanted to ask...
why did google left only one page all of a sudden?
I had 45 indexed pages yesterday, today only one...
Can someone help me?
why did google left only one page all of a sudden?
I had 45 indexed pages yesterday, today only one...
Can someone help me?
In the first message of this thread you wrote that Google started indexing your home page around June 19. That is a few days after the Esmeralda update began. That would mean that all the 45 pages indexed until yesterday were not in the full index. IIRC, all your pages had a fresh tag (date next to the URL). So most likely it was the 'fresh bot' that kept your pages in the SERP. But the weird thing is, you only have a sub page left over, not your home page.
Soon 'fresh bot' will bring in some more pages into the SERP. Maybe you should just have some more patience until the next update, and be happy with the pages that were already shown in the SERPs for the last few weeks. After all, the site was found after the last update and has only few inbound links.