Forum Moderators: open
I have found a number of pages on other peoples' sites that link to various of my sites, but which are effectively orphaned, and therefore Google does not crawl. They have useful PR (in fact, one page is a PR6 on the tool bar and I am the only outbound link on it). Of course, I want Google to find these and give me PR, etc. benefit from them.
There are 7 ways I can think of. Remember, I am submitting the page the URLs of someone else's sites, not mine;
1) Submit each page to Google (all 10 at the same time) using their addurl page.
2) Use a 3rd party multisubmitter to submit one a day for 10 days.
3) Use a 3rd party multisubmitter to submit each one in turn, 5 minutes apart.
4) Add links to these URLs on the sites that would benefit from them, so they get crawled
5) Add links to the URLs on a single site of mine that is not connected with the sites that stand to get the benefit.
6) Use the Advnaced Google Toolbar browser and visit these pages (I have heard this will "make" Google crawl the pages they do not have in your database that you visit).
7) Use a log Spammer tool to plant the URL's are referrers on the whitehouse's web site.
Any feed back?
:-)
If that site has PR6, it is already indexed in Google. No use to submit again. It will get crawled and probably so will your site.
Things just seen to be too slow, when you are waiting. It can take three months before a link is indexed and before you can benefit from it.
But as I stated, these pages are effectively orphans. So Google will not find them by itself.
No, this is where the site has no link structure to these pages (the ones I want indexed). For example, they may have a robots.txt file deny to /cgi-bin/, but where a cgi generated page is actually a solid html file (but with no other links to it other then via the /cgi-bin/ which Googlebot et all have been told not to go down). The page is therefore outside of the cgi-bin deny area, but not linked to from anywhere else on the site, and is therefore an orphan as far as Google are concerned (they do not know it exists).
I found a PR5 page with links to me - this was someone who did a page swop with me (He added a page of text and links of my design on his site, I did the same). When we fell out over his believing this gave him certain extra rights on one of my sites, he removed the link to that page (but not the page off his server itself). I removed both link and the page I had put up for him.
Google never got to crawl that link on his site (obviously) and find that page. But, as that page is still there, I thought I would like to help Google find it :-)
If you want Google to crawl the page then by far the quickest and most efficient way of doing it is to link to the page in question from your own site. The only thing that will stop Google indexing the page is a noindex META tag or denial in the robots.txt of the site. Remember though that the real PR of the page is calculated on incoming links that Google knows about, so even though the toolbar guesses the PR at 6, it may be only 1 or 2 with your single incoming link.
John_Caius
GG has stated they can crawl anything including cgi and php etc. But clearly, often, it is not allowed to do so for one reason or another. That does not stop it from giving a page PR that is has not indexed previously. If it assumes a PR then it must accept, in principle, that level of PR is due for that page solely on the basis it is a page on that site. PR gets to a page by inbound and internal site links. But it does not need inbound links to be given PR from the mother site. And if there are no internal links, G still gives it PR just in case.
Anyway, I still would like to know if people have had success with getting Google to crawl pages using the submit URL form either directly or with a 3rd party submission tool.
Any ideas how to get the real PR of a page?