homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Marketing and Biz Dev / General Search Engine Marketing Issues
Forum Library, Charter, Moderators: mademetop

General Search Engine Marketing Issues Forum

Spidering Embedded PDFs
Which URL will Google show: HTML file or its embedded PDF file

WebmasterWorld Senior Member 10+ Year Member

Msg#: 4215880 posted 9:17 pm on Oct 12, 2010 (gmt 0)

I have HTML pages that use the Google PDF Viewer to display PDF files from my server. (http://googlesystem.blogspot.com/2009/09/embeddable-google-document-viewer.html)

1) html-page-1.html (with ads, logos, etc.)
2) -> download link: pdf-file-1.pdf
3) -> also iframed: pdf-file-1.pdf

Here are my questions:

1) When Googlebot visits, will it spider BOTH the HTML container page AND the iframed/linked PDF file?

2) When that content is included as a Google search result, will the Google link be to the HTML page or to the PDF file or to both, as two results?

In other words, does Google see the parent page as THE page, with the iframed content considered as part of that page, or does Google consider the parent page and the iframed content to be separate pages?

3) Will it even spider the iframe content, at all?

I MUST get Google to index the PDF files, because the HTML container pages don't have any content except for ads, logos and site navigation links.

That being said, my goal is to have Google ONLY display links to the HTML pages, never to the embedded PDF files.

Thanks in advance.


Global Options:
 top home search open messages active posts  

Home / Forums Index / Marketing and Biz Dev / General Search Engine Marketing Issues
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved