homepage Welcome to WebmasterWorld Guest from 54.196.189.229
register, free tools, login, search, subscribe, help, library, announcements, recent posts, open posts,
Subscribe to WebmasterWorld
Visit PubCon.com
Home / Forums Index / Code, Content, and Presentation / Apache Web Server
Forum Library, Charter, Moderators: Ocean10000 & incrediBILL & phranque

Apache Web Server Forum

    
mod_rewrite and stuff
vague questions from an apache newbie.
WibbleWobble




msg:1523003
 4:56 pm on Feb 12, 2003 (gmt 0)

Hello hello!

So, to the problem:

At the moment we're working on some optimised information pages for a website using a CMS, and very few of their pages are being indexed. Current CMS based pages have URL's like so: /dir/dir2/page.ext?id=number. Now theoretically, these should be indexed - at least by Google. They're not. I suspect this is due to the re-directs they have in place at the index of various areas - typing in domain.com/dir re-routes me to a page with the aforementioned directory structure. I think this is done through some mystical apache htaccess lark, rather than meta tags. That'd be fine, if either were in the SE databases.

Anyway, the long and short of it is we're looking to insert our pages alongside the CMS, which means at least one link from the CMS pages in reality; but these aren't indexed. If I were to use mod_rewrite or some other fabled ISAPI filter to change the dir structure to say /page/id.ext, the pages would be much more likely to be spidered, right? This is the logic I'm working on, anyway. If we can then get a link from a newly indexed page, then alls fine and dandy, indeed?

The other options were cloaking, and off-site hosting, neither of which were as preferable as trying to make the entire site spiderable. I realise this question has undoubtedly come up before, but I must desire attention or something, as I thought it worth a post. I'm just kind of seeking confirmation and guidance before I suggest a plan of action, really.

Thanks muchly.

Addendum:
If they're not running apache, and are running IIS or something else, is there a similar tool/filter/thing available?

 

Birdman




msg:1523004
 5:25 pm on Feb 12, 2003 (gmt 0)

You can definately make your urls spider friendly with mod_rewrite [webmasterworld.com].

If you haven't tried yet, do a site search [searchengineworld.com] for mod_rewrite. There are lots of good threads, including the IIS issue.

WibbleWobble




msg:1523005
 1:42 pm on Feb 13, 2003 (gmt 0)

Ack, I knew I forgot something.
If I use mod_rewrite to hash out a nicer URL, will their existing CMS need to have links changed, or will rewrite take all that lark into account? Point being, their sitemap uses silly long URLs atm, and I think changing it'd be a hassle I can do without.

I shall run through the threads when I get a moment, thanks.

andreasfriedrich




msg:1523006
 3:12 pm on Feb 13, 2003 (gmt 0)

mod_rewrite [httpd.apache.org] rewrites URIs. It does not postprocess anything that gets output by a CMS. In Apache [httpd.apache.org] 2 you could write such a postprocessing filter very easily. Using Apache [httpd.apache.org] 1.3 this is a bit harder but possible as well.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Code, Content, and Presentation / Apache Web Server
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved