homepage Welcome to WebmasterWorld Guest from 23.20.63.27
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Pubcon Platinum Sponsor 2014
Home / Forums Index / Google / Google News Archive
Forum Library, Charter, Moderator: open

Google News Archive Forum

This 35 message thread spans 2 pages: 35 ( [1] 2 > >     
google, most pages not showing title/summary
rlkanter




msg:122459
 9:24 pm on Oct 31, 2004 (gmt 0)

I'm not sure if this is the correct forum for this, and I'm not entirely sure how to explain this. If I do a search on google for:

site:domain.com

It shows something like 1700 results, but most of them only show a single line such as:

domain.com/item/display-item.php?item=5341
Similar pages

Each line has a different item number

Also if I show results in English only it only shows around 400 results.

Other search engines do seem to index them correctly, and the pages themselves display correct as far as I know. Any hints? Thanks :)

[edit to clarify display]

 

rlkanter




msg:122460
 8:33 pm on Nov 9, 2004 (gmt 0)

Now google shows 2.7k pages and even less show up for "english results only".

jdMorgan




msg:122461
 9:14 pm on Nov 9, 2004 (gmt 0)

Is this a new site < 3 months old?
Are these pages "Disallowed" in your robots.txt file?
Does each page return a correct 200-OK server response?

Google lists a "URL-only" link if it finds a link to the page, but has not yet crawled the page to determine its contents.

Google lists a "URL-only" link if it finds a link to the page, but cannot crawl the page because Googlebot is Disallowed from crawling that page by directives in your robots.txt file.

You could also possibly confuse Googlebot if the page returns an unexpected server response code. Use the Server Headers chacker in your WebmasterWorld Control Panel to test, and make sure you get a 200-OK response.

Jim

siteseo




msg:122462
 9:17 pm on Nov 9, 2004 (gmt 0)

This could be caused by any number of factors, but your chief concern is duplicate content. G will reduce it's index of your page to the title tag and URL if it detects a certain amount of duplicate content on the page, be it content duplicated on other pages of your site, or, more importantly, content taken from someone else's site. Eliminate the dupe content and you should be alright.

rlkanter




msg:122463
 10:50 pm on Nov 9, 2004 (gmt 0)

I checked the headers with the utility and it gave 200 result.

Nothing in the robots.txt is disallowed.

The site is about 6 months old, but a majority of the pages aren't indexed.

With the exception of links on the left side of the page, all the pages are different.

tr95




msg:122464
 11:23 pm on Nov 17, 2004 (gmt 0)

I have this problem with some pages that contain no duplicate content -- unless a spammer has duplicated them unbeknownst to me. One page in particular ranked at or near the top of its narrow category until about August 2004, when Google suddenly demoted it to title and URL only. There is nothing else quite like the page and nothing nearly as good in the current Google results. The page isn't in Yahoo or MSN either, but I don't believe it ever was. Consequently, very few people are finding it now.

Should I change the URL and title? It has been my practice never to change URLs to avoid breaking links and bookmarks, but I would try anything to get this page back into the Google cache.

zeus




msg:122465
 11:41 pm on Nov 17, 2004 (gmt 0)

many sites has this problem from about 2 nov. and I think this is also the reason why many sites has fallen in the ranking, because there pages is not fully valued.

P.s and this is the reason why Im not in Las vegas

benjerz1982




msg:122466
 11:58 pm on Nov 17, 2004 (gmt 0)

Hi,

Could someone speculate on why my site has an URL only listing for the homepage (which gets spidered a lot) but lesser pages on the same domain are still there?

Also when doing a list of all pages in google on that domain the results are long ago deleted (3 years!)

Could this be because there were connectivity problems last time google came calling?

Thanks in advance

nsqlg




msg:122467
 12:16 am on Nov 18, 2004 (gmt 0)

For me too, I have one site with more than 70% in limbo (partially indexed), dont is duplicate why in my language is very rare spammers, PR5, 5 years old. I'm hate this G "bug", keep only URL is worst.

Spine




msg:122468
 12:22 am on Nov 18, 2004 (gmt 0)

I've got long deleted pages showing too.

Get your s*&t together google.

bucaro




msg:122469
 12:34 am on Nov 18, 2004 (gmt 0)

The same problem happens to me.

Home page is not listed and when it does only url is listed. All other pages are listed fully.

I search for a unique phrase in my website, which has been there for 2 years, now I several competing sites with it.

Can't google tell how had it first?

This is a terrible bug.

Lorel




msg:122470
 11:46 pm on Nov 18, 2004 (gmt 0)

Your site may have been hijacked. To find out, Search google for the following:

inurl:www.domain.com

Look for URLs with "tracker2" included in the url with a redirect to your own site similar to the following:

www.hijacker'sdomain.com/tracker2.php?url=http://www.innocentvictum'sdomain.com/subpagegoeshere.html

Then look for any other URL that has a redirect hidden in the title of the page i.e., FILE NOT FOUND, which redirects from their page to yours.

Sometimes your own page title will have someone else's URl in the link so run your mouse over every url and watch the bottom of your browser and see where it goes. If you suspect foul play click on the link to see what happens. (don't click on your own PayPerClick links however).

See this thread for more info:
[webmasterworld.com...]

sasha




msg:122471
 2:19 am on Nov 19, 2004 (gmt 0)

I have 11,000 URLs that show no title and no description. Have all of them been hijacked?

This is not a new site and Google had previously indexed some of those URLs.

There is also no "duplicate content" (whatever that means!) each page has unique on-page data, meta tags, alt-tags, etc.

It's either a bug, or a PR stunt by Google: We spidered 9 zillion URLs! (yes, that's true, but only half of those URLs are fully indexed).

Lorel




msg:122472
 2:35 am on Nov 19, 2004 (gmt 0)


I have 11,000 URLs that show no title and no description. Have all of them been hijacked?

If they have been hijacked the results from you inurl:www.domain.com search should have one of the symptoms mentioned above or possibly some other kind of redirects. If those aren't in evidence then it may be that you have done something that google considers spam or another problem.

sasha




msg:122473
 3:16 am on Nov 19, 2004 (gmt 0)

but of course my website is spam!

Has it ever occurred to you that the problem may be with Google?

Seo1




msg:122474
 3:19 am on Nov 19, 2004 (gmt 0)

For those of you finding old pages from your site remember google is a database indexing search engine meaning the results of its crawls are stored in their database and can show pages back as far as when they started.

Clint

LowLevel




msg:122475
 3:34 am on Nov 19, 2004 (gmt 0)


There is also no "duplicate content" (whatever that means!) each page has unique on-page data

In percentage, how much unique visible text has each page, compared to non-unique text (menus, common headings or titles, etc.)?

bucaro




msg:122476
 4:33 am on Nov 19, 2004 (gmt 0)

I did the inurl and found some pages linking to me with a redirect script. Google lists them with my title and desc. So can I do next?

The also list a lot of pages long removed from my site.

sasha




msg:122477
 4:44 am on Nov 19, 2004 (gmt 0)

> In percentage, how much unique visible text has each page, compared to non-unique text (menus, common headings or titles, etc.)?

I would say about 20% is unique.

Our site is a directory.

Each directory member has a page containing their name, address, phone, email.

How much of the visible text that you see on yellowpages or citysearch is unique in percentage terms? I would say, probably the same if not less.

Lorel




msg:122478
 11:45 am on Nov 19, 2004 (gmt 0)

Hi Bucaro,


I did the inurl and found some pages linking to me with a redirect script. Google lists them with my title and desc. So can I do next?

Gather domain name data off of
[internicdomainnames.net...]
Make note of the contact email of the registrant and or administrator and send the owner a letter asking them to please remove the redirect. If you don't hear back from them or the letter bounces then try and get the contact email off the offending web site (if there is any) and write again. If this email contact bounces also then contact the hosting company and write them. The hosting company is listed at the bottom of the domain name data. If there are only IP numbers use this site to gain more info:

[freeality.com...]
scroll down to domain name search and input the IP address into:
EONS reverse IP address and it will give you the name and contact data of the hosting Co.

Then search the hosting companies web site for their TOS (terms of service) or policy statement or rules for their clients (which always contain rules about copyright infringement). Quote those rules and point out the ones that their client is breaking. Also include their domain name data and a copy of the bounced email. Ask them to let you know how they plan to handle this and when it should be resolved as you will go higher up the chain of command if they don't respond. They should respond quickly because they could lose their license if they let this continue.


The also list a lot of pages long removed from my site.

You might enter those into Google's add/remove url.

cwnet




msg:122479
 12:13 pm on Nov 19, 2004 (gmt 0)

May someone enlighten me, please!

Using the inurl: command I found the following link

www.eWidgetdirectory.co.uk/includes/go.php?goto=http://www.MySite.net&pos=17&keyword=Widget%20CityName

clicking on it opens my index page

The people behind it operate several sites covering different widgets and thousands of links like the above.

Whats the use? I mean, really, I just dont get it...

<owner edit> oh yeah, just found out...a goddamn link exchange directory...if a mod comes by, feel free to delete this post</owner edit>

petehall




msg:122480
 12:28 pm on Nov 19, 2004 (gmt 0)

Spine: I've got long deleted pages showing too. Get your s*&t together google.

I have been seeing this for quite some time now.

It's odd, I'll give them that. I'm seeing pages which are years gone...

Lorel




msg:122481
 6:48 pm on Nov 19, 2004 (gmt 0)

May someone enlighten me, please!

Using the inurl: command I found the following link

www.eWidgetdirectory.co.uk/includes/go.php?goto=http://www.MySite.net&pos=17&keyword=Widget%20CityName

clicking on it opens my index page

Yes, but it goes to their site first and then yours, and thus Google attributes the PR to the first site, i.e., they steal your PR with this link.


The people behind it operate several sites covering different widgets and thousands of links like the above.

Whats the use? I mean, really, I just dont get it...

<owner edit> oh yeah, just found out...a goddamn link exchange directory...if a mod comes by, feel free to delete this post</owner edit>

If this is a link exchange directory you didn't sign up with ask them to remove your link as it may be stealing your PR. I would ask to have it removed even if you signed up with them.

I'm getting real pickey with sites I link to now. I check the links on their link pages and if they have any redirect code, similar to the above, I don't sign up.

However, dishonest hijackers appear to be hiding their code so you may not be able to detect it beforehand.

mcrystal




msg:122482
 8:50 am on Nov 20, 2004 (gmt 0)

Hi,

I have experienced similar things with only my URL showing up in Google. I did the inurl search, and many of the results that showed up in addition to my own pages were of this format:

[excite.co.jp...]

Does anyone know what this is about? If it is a problem, any suggestions for how to deal with it?

Thank you very much,
Matt

downsideup




msg:122483
 6:23 pm on Nov 20, 2004 (gmt 0)

My sitesí pages were slowly disappearing since July this year from the google indexed. I was very perplexed as to why all of a sudden my google traffic has dropped so much. At first I thought it was google duplicate content algo, but then I discovered all of my sites are hijacked.

I did a site:www.mysite.com and inurl:www.mysite.com google's index shows all the pages are now in the google supplemental index. Some of the urls have the tracker2.php and other with redirects with some of them even having the hijackerís site url as my siteís title.

One of my used to have PR8 and now itís Pr6 and the rest are pr4-5.

My google traffic are practically all gone.

I need advise as to how to resolve this situation.

www.the-widget-masters.com/tracker2.php?url=http://www.mysite.com%2mypage_B.asp

[widget.com...]

www.widget.com/Goto/GotoWebPage?gotoUrl=http://www.mysite.com%2page.asp&gotoType=SE&SrchItem=3

zeus




msg:122484
 8:08 pm on Nov 20, 2004 (gmt 0)

I have the same problem, go to the whois, dont contact the owner go directly to th host info and send them a email.

I hope the best for you

downsideup




msg:122485
 1:24 am on Nov 21, 2004 (gmt 0)

Zeus,

I will try to contact the host, but just curious did you have any success in having the host banning their site?

Was your sites got reindexed by google and achieved the previous rank?

Thanks,
Downsidup

chrisk999




msg:122486
 2:05 am on Nov 21, 2004 (gmt 0)

My site (PR7) that has been going strong since 1998 has only got url/'similar pages' too. Very odd - googlebot hasn't done a full visit for a while now, and it used to eat up the entire site once per day. There's no cache available for 99% of the pages.

The PR6/7 still appears on the toolbar, so no ban. Very strange.

This odd behaviour started when the site went offline for 4 days due to a technical problem last month (and had to switch IPs as a result). Could this be the cause?

Lorel




msg:122487
 2:26 pm on Nov 21, 2004 (gmt 0)


I will try to contact the host, but just curious did you have any success in having the host banning their site?

I had very quick success with one host--they threatened to remove the client's web site within 24 hours if the offending link wasn't removed. And within 2 days that link dissappeared from my inurl: search.

However, with the tracker2 hijacker/host that I believe you mentioned above--I've had no luck and this co appears to have hijacked thousands. Their host hasn't responded and I've written twice. It's a co in British Columbia. Time to go higher up the tree.

DerekH




msg:122488
 4:19 pm on Nov 21, 2004 (gmt 0)

chrisk999 wrote
This odd behaviour started when the site went offline for 4 days due to a technical problem last month (and had to switch IPs as a result). Could this be the cause?

I suspect not - two of my sites have almost vanished now...
One 17 page site only has 2 indexed pages now, and one 325 page site only has 4 indexed pages. Curiously, all four of those indexed pages are still being crawled, and the cache date is updating. But why the rest have gone beats me - they've been there for four years...
DerekH

This 35 message thread spans 2 pages: 35 ( [1] 2 > >
Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google News Archive
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved