Forum Moderators: open

Message Too Old, No Replies

Translating jsp to html

so the spiders will like me...

         

deft_spyder

8:45 pm on Dec 18, 2002 (gmt 0)

10+ Year Member



One aspect of my site is a doctor finder that is completely jsp/databse driven. I've been working with my buddy to find a script that will export a copy of our site to html. I will make a master doctor page for the spiders that will then link to every doctors page in html. We'll set this up to export a new doctor page every time a doctor updates it in his edit page.

Has anyone had any luck making or finding a nice script for making this happen? jsp is what we are working with now.

pageoneresults

9:01 pm on Dec 18, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Hello deft_spyder, welcome to WebmasterWorld!

I did some quick searching and came up with one item that appears to be an answer to your situation. I've never worked with jsp pages and don't see many in the serps (Search Engine Results Pages). From what I've read in the past, the technology has not really caught on. It appears that php and asp are the two predominant scripting languages.

Do a search in Google for...

javax.swing.text.html.HTMLDocument

agerhart

9:19 pm on Dec 18, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I have worked with a few JSP sites and have had no problems within the SEs thus far. Make sure that your URLs are not too long and they do not contain special characters (?,&,@).

deft_spyder

9:20 pm on Dec 18, 2002 (gmt 0)

10+ Year Member



well, yeah, its not on alot of SE's, but we did code alot of barnesandnoble.com and pepsi, talbots, and a few others with it.

Thankyou so much for finding that, and I'll pass it on asap. I try to take as much load off my programmer as I can as he's a friend with kids, and anyone with kids knows.... they just seem to consume time.... :)

andreasfriedrich

9:33 pm on Dec 18, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I admit that Iīm a bit baffled by this thread. You say that you have a jsp/database driven site and that you need to convert jsp to html for the SE spiders.

Unless your site is totally useless right now, there must be some kind of interface to the UAs. For how are surfers accessing your site? I believe they get servered HTML documents since that is what browsers will understand. Thatīs all you need for the SEs as well. There is no fundamental difference between browsers and SE spiders.

Andreas

deft_spyder

10:31 pm on Dec 18, 2002 (gmt 0)

10+ Year Member



let me unbaffle.

there is a section of the site called the doctor finder. It has a search by name/state/city. The final page is a jsp driven dynamic page with a parameter in the header that gives that specific docs info.

The final URL through that finder is: [mysitename.com...]

Maybe im being too cautious? Perhaps a search engine will use that raw url and index it? I wanted to stay away from giving the search engine a URL that looks like that.

Am I ok? if thats the case, I'll just geenrate a page that has every doctor link on it, as a "doctor master list/site map".

bcc1234

11:12 pm on Dec 18, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Check out the site in my profile. Eveything is built on JSP.
Although, I left search as ...jsp?query= ... for the sake of no ruining the look completely, the rest looks like HTML.

What you need is a servlet that would match a pattern and then would call an appropriate page using RequestDispatcher.forward.

Much more flexible than mod_rewrite.

xbase234

12:42 am on Dec 19, 2002 (gmt 0)

10+ Year Member



Copying a whole dynamic site to HTML sounds like a real pain.

Have you done everything you can to get your pages spidered? Google will spider and index JSP.

ppg

1:22 am on Dec 19, 2002 (gmt 0)

10+ Year Member



deft_spyder: From my experience of google and jsp/database driven pages, if you only have one parameter in the querystring then the page will get indexed, any more and it won't.

If all your URLS are like the one you posted above, I would have thought that the links on your doctor master list page would be followed ok, at least by googlebot. I don't know about any other engines.

a servlet that would match a pattern and then would call an appropriate page using RequestDispatcher.forward
Thats a really nice idea!

andreasfriedrich

1:27 am on Dec 19, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



My point was that the spider does not care at all what kind of technology you use to serve a html document as long as you serve one.

The only thing that might prevent a spider from spidering your site is the form of the URI. You wonīt have to worry a single bit if you use static URIs. The more parameter you have in your query string the less like you will get spidered. One parameter seems to be ok though.

Andreas