Forum Moderators: open
User-Agent: Googlebot
Disallow: /*.swf$
The swf files are still indexed though...is there some kind of command I can give to google to drop these files from the index? Thanks!
Try adding this to your HTML file :
<META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW">
INDEX directive specifies if an indexing robot should index the page. The FOLLOW directive specifies if a robot is to follow links on the page.
Google also says in its FAQ : "In order to save bandwidth Googlebot only downloads the robots.txt file once a day or whenever we have fetched many pages from the server. So, it may take a while for Googlebot to learn of any changes that might have been made to your robots.txt file. Also, Googlebot is distributed on several machines. Each of these keeps its own record of your robots.txt file."
The swf files are still indexed though...is there some kind of command I can give to google to drop these files from the index?
From Google FAQ :
<META NAME="ROBOTS" CONTENT="NOARCHIVE">
This tag will tell robots not to archive the page. Google will continue to index and follow links from the page, but will not present cached material to users.
Removing a page from Google's index :
".... Google's policy for removing a page from our index requires that we obtain the permission of that page's webmaster .... we will remove the offending page from our index. For more information on this process, please see [google.com...] "
You can get more details about all this and more here :
Google FAQ's [google.com]
because of the way the flash sites are built I can't put them in another directory
If you can rename all your .swf files, there is a solution.
Let all your .swf filenames start with the same letters (for example: "swf", so "intro.swf" would become "swf-intro.swf").
Now you can add these lines to your robots.txt file:
User-Agent: Googlebot
Disallow: /swf
Make sure that only .swf files start with that letters...