Forum Moderators: phranque

Message Too Old, No Replies

Generating HTML pages from ASP

Dicussing the possibility of generating HTML pages from ASP pages for SEO

         

karloszantana

8:38 am on Jan 5, 2006 (gmt 0)

10+ Year Member



Does anyone out there know if there is a program that mass generates html code from ASP? I know its possible but I don't know how. I know browsers read finished ASP's like they would an HTML page but I want all the generated ASP pages to be in HTML so that spiders can read keywords on the page. I think that in terms of SEO it would be a good idea. If anyone has an idea of how this works please reply ASAP. Thanks.

Easy_Coder

8:43 pm on Jan 5, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



your asp is already output as html so those keywords will be picked up.

karloszantana

5:52 am on Jan 8, 2006 (gmt 0)

10+ Year Member



Thanks for the reply. Spiders have a tough time reading pages generated through ASP. Since the information to be indexed has to be called from a database, there is no way for the spider to enter and search the database. Spdiers have a problem reading links from ASP too. Its easier for a spider to read www.mydomainname.com/content150.html than it is to read www.mydomainname.com/content.asp?page_id=150
Spiders have a problem reading question marks. They are a lot more comfortable visiting html links. This is one of the basic concepts of SEO. So, once again, if you know a way to generate html from ASP I'd love to hear it.

JAB Creations

8:34 am on Jan 8, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



The only way to effect SEO is to effect what HTML search engines find when they visit your site. So in effect if you compare serving ASP, PHP, or not using any serverside languages at all to serve the exact same content you will only see various extensions on files that are no indexes and you should not see any differences between SERPS/SEO based solely on changing the extensions on files.

If you want to increase your SEO/SERPS then you need to work on the (X)HTML you're serving. If you're using serverside languages then you simply need to work on the (X)HTML within the serverside code (but not necessarily the serverside code itself).

To the point I suppose: Using ASP versus not/something else will not change SEO; changing the (X)HTML code will (regardless of what you use to create/serve).

karloszantana

9:04 am on Jan 8, 2006 (gmt 0)

10+ Year Member



Thanks for the info. I totally agree with you. I just took over a website that was built without search engines in mind, i.e. lots of swf files and no meta tags or rich content on the site to search. The only content on the site is generated using ASP's.

My initial plan is to change the index file to a content rich page and introduce meta's and alt tags wherever I can and a robot instruction file.

I'm still curious though about the severside pages. If its possible I'd like to change them all into html but with so much content and constant updates it might not be feasible. Thats why I thought if it was possible I'd find a process that generates pure html pages from ASP's. I guess in the end though I'll still be stuck with the ASP's and will have to work on the html part of the pages.

Oh well. Back to the drawing board.

sharbel

2:26 pm on Jan 8, 2006 (gmt 0)

10+ Year Member



You are 100% wrong in saying that search engines have a tough time with .asp/php/aspx/jsp pages. I have several asp.net sites that are ranked #1 in all engines..

Easy_Coder

4:51 pm on Jan 9, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I've never had any trouble whatsoever with spiders grabbing dynamic asp pages and I've been using it since 1998.

karloszantana

6:16 am on Jan 12, 2006 (gmt 0)

10+ Year Member



From what I have been reading spiders will not traverse a database for information or pick up on pages generated on the fly. ASP's that come up during a search are supposedly a result of luck and are haphazard in maintainnig a renaking.
There are several articles about this topic if you search for it. This is one of the sites I found today but there are a lot more. [library.albany.edu...]

physics

6:59 am on Jan 12, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member




Since the information to be indexed has to be called from a database, there is no way for the spider to enter and search the database.

This isn't true at all. The 'job' of your asp code is to draw the data out of the database and serve it up as HTML code. The SE spider or user's browser never has to directly query or search your database.
The issue of dynamic vs. static pages is an old debate, but there's no doubt that many .php/.jsp/.asp, etc pages rank very well in the search engines. In any case it's 2006 now and the major SEs can handle .asp as long as you don't have obnoxiously long URLs or duplicate page problems due to session id strings in the URL.
If you want .html to be the extension for your ASP page you can get isapi rewrite if you're running IIS or use mod_rewrite if you're on Apache. Then you have dynamically served ASP that a search engine spiders think is a static .html page, for what that's worth nowadays.

karloszantana

7:24 am on Jan 12, 2006 (gmt 0)

10+ Year Member



Thanks Physics. That was helpful. Its not that I don't think ASP's can be indexed, I just think its easier for html extensions. I will look into rewriting the extensions. Much appreciated.

Easy_Coder

11:39 am on Jan 12, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



If you want .html to be the extension for your ASP page you can get isapi rewrite if you're running IIS or use mod_rewrite if you're on Apache.

You don't even need isapi re-write on iis... you can have iis process .htm and .html pages as if they are ASP if that helps you.

karloszantana

11:59 am on Jan 12, 2006 (gmt 0)

10+ Year Member



Do you mean process ASP pages as .htm and .html? If so how do you get IIS to do this? I was looking for a solution like isapi rewrite but if I can do it myself then why not?

physics

5:47 pm on Jan 12, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



ASP files can be indexed. Try inurl:.asp on Google.
I get half a billion pages.

physics

5:52 pm on Jan 12, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Easy_Coder is right from what I know. But this has to be done at the admin level. Research mime types on IIS.

sharbel

1:35 pm on Jan 13, 2006 (gmt 0)

10+ Year Member



Again, you are 100% wrong if you think that indexes have trouble indexing .asp/.aspx/.jsp/.php pages. 100% wrong...

However, if you insist on using .htm, you can certainly click the Configuration button for the website in the IIS MMC and assign the ASP DLL to the .htm extension, which would assign ASP to pre-process your .htm files just like .asp pages.

The extension is not your problem though. If your URLs are [widgetworld.com...] *that* is likely what people are talking about not getting good *RANK* (ie Supplemental listings). You will still be *INDEXED* just fine but a lot of the engines will not give you good rank (Google likes to shove these in the Supplemental listings I notice).

If you have all that querystring information in your URL, you will get the same result whether it's an .asp extension or .htm. Again, the extension is irrelevant here.

If your page in question is [widgetworld.com...] or [widgetworld.com...] the *on page* optimization as well as the *off page* optimization is going to determine your ranking (assuming all things equal with domain age etc etc).

HTHs

celgins

1:53 pm on Jan 13, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Karloszantana.... I can't think of an easy way to mass generate HTML from ASP and I agree with with Sharbel on this one. Your .ASP pages will be indexed just fine.

But it sounds to me like your concern is with the dynamic HTML that doesn't exist until a user triggers it? (i.e. form submission, database search, etc.)

MrSpeed

5:34 pm on Jan 13, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



sharbel is right.... 100% right :)

I'm curious. What is the current problem with the site? Is it not getting indexed or not ranking well? Maybe there are some very basic issues with your SEO that can be improved.

Easy_Coder

9:54 pm on Jan 13, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



You can put ASP code in your HTM and HTML files and then instruct IIS to process all .htm and .html files using the asp.dll.

Here's how...

1) Launch your IIS Manager
2) Locate your website in the site tree
3) Right click on your site and select properties
4) Click on the home directory tab
5) In the Application Settings area of the home directory tab click the configuration button
6) If the .htm and .html extensions are not on the list then click 'add' otherwise choose edit
7) You'll have to browse to the asp.dll (windows\system32\inetsrv\asp.dll (select it)
8) enter .htm into the Extension field.
9) Limit verbs to GET, HEAD and POST
10) click ok and repeat steps 6 - 9 for .html
11) click ok

your done... but this really isn't necessary becase asp pages dynamic or not get indexed. If you still want to do this at least now you know how.

karloszantana

12:49 pm on Jan 15, 2006 (gmt 0)

10+ Year Member



Thanks for the feedback guys. I have no doubt that asp pages get indexed. It is the ranking I'm worried about. I want to genereate pages on the fly just because I have a huge database to handle; but if I could output them as static html files I figure that the info on those pages would be "ranked" higher after indexing than generated asp pages. I also want to start a reciprocal link campaign and (for dummies) its visually easier to read an html link than an asp one.
There is a website here in the middle east called www.ameinfo.com They have a system setup similar to what I have in mind and any search using a keyword with any relation to business or to the location they're in results in a top ten ranking.
Thanks for the information Easy_Coder, Sharbel and everyone else. Right now I need to conecntrate on getting more links to my site so I can please the Google search algorithm. ;) Cheers guys.

kevhead

3:13 pm on Jan 26, 2006 (gmt 0)

10+ Year Member



Karloz,

You are actually correct. During the early years of dynamic web sites, search engines did NOT like .asp and similar database-driven pages. The search engines adapted, briefly, and made changes to index dynamic pages better. However, in the past 18 months, there has been a shift to place more of an emphasis on content located at the top of the domain. For instance, if you have a dynamic site, and your default home page is default.asp, then more emphasis for ranking your site will be based on what's on that page as opposed to sub pages (ie. default.asp?page=products). Certain search engines ignore all data in a URL once the spider sees a?,= or &.

Does that make sense?

I've built a content management system using ASP and SQL that generates dyanmically named ASP pages, which helps. I'm very close to having it auto-build HTML from the dynamic pages, which is why I stumbled onto this thread.

If you are concerned with SEO for your site, while having a dynamic site is not a death knell, HTML will fare much better, at least right now.

kevhead

3:22 pm on Jan 26, 2006 (gmt 0)

10+ Year Member



Sorry to reply to my own post, but check this out:

<snip>

[edited by: trillianjedi at 7:08 pm (utc) on Feb. 1, 2006]
[edit reason] Please see TOS #13 - thanks ;-) [/edit]

leapforward

4:01 pm on Jan 26, 2006 (gmt 0)

10+ Year Member



It is definitely possible to get dynamically generated ASP web pages, that represent hundreds of thousands of rows in the database, crawled and ranking well.

I have personally built database driven ASP pages that are massively crawled (10K to 25K pages per day) by all of the major search engines. Also, many of these dynamic ASP pages have come up very high in search engine rankings.

You seem interested in the problems with URLs. I used keyword rich URLs for each item in the database and a modified 404 error page to display all of the content.

The keyword rich URLs look something like this:
[domainname.com...]

When a request comes in to this URL, my 404 page handler parses the URL and determines which product it matches in the DB and if I get a match, I return a product page that doesn't look like a 404.

The search engines still need to be able to find each dynamically generated ASP page. This is true no matter even if the pages are static, cgi, php, etc. Use the tips and techniques described in other topics on this site if you want to help get your pages indexed.

Remember: Keywords in the URL are only 1 part of SEO. A smart man wrote 26 SEO steps 4 years ago that are still quite valid today. [searchengineworld.com...]

sharbel

9:07 pm on Jan 29, 2006 (gmt 0)

10+ Year Member



Just because a site is 'dynamic' doesnt mean it will have problems with search engines. If your URL is www.mydomain.com/something.aspx?id=293&fid=29&x=2&y=2&page=292 you will have trouble.. but if your URL is www.comain.com/something/widgets/blue-widget.aspx you aren't going to have problems..

Please don't make blanket statements like "If you have pages other than .html" you will have trouble, because it's just flat out wrong. Granted, if you dont know what the heck you are doing, you will have trouble. I would suggest if you don't know what you are doing, you will have equal trouble with your flat .html files too :)

karloszantana

7:53 am on Jan 30, 2006 (gmt 0)

10+ Year Member



Sorry Sharbel. I respect that you have your opinion but from my research and experience, I have formulated my own opinion that most search engines, not just google, prefer static html to databas driven pages.
I agree with you totally that it is a bad idea to have URLS with syntax containing command parmaeters like (?,%,&, etc) but I believe further that spiders have preferences to the hyper text markup language than server driven pages.
Unless you've met with the algorithm designers from Google, I think its wrong to suggest that you have all the answers.
I started this thread to discuss a topic and share opinions, not to convince the world of who is right and wrong. So chill.You shouldn't be making blanket statements like "you're completely wrong"

sharbel

12:30 am on Jan 31, 2006 (gmt 0)

10+ Year Member



but I believe further that spiders have preferences to the hyper text markup language than server driven pages

You show your lack of knowledge with that very statement.

SERVER SIDE CODE RETURNS STANDARD HTML (shouting intended). The spiders (or anyone for that matter) never see your server code..

And no, I am not about to 'chill' when you are trying to convince people about things that are wrong. A lot of newbies come here to learn, and I am sorry, but I wont just sit back and watch someone 'educate' them when they have no idea how server-side applications run.

I swear I am not intending this to go into a flame thread, but if you don't even understand how the hell server-side apps return HTML, how are you going to preach that they are somehow not good?

And, just for the record, I have 5 sites ranked #1-3 in google, Yahoo, MSN all done in ASP.NET and all in fairly competitive 'health-related' keywords... :)

abbeyvet

2:08 am on Jan 31, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I think you are misinterpreting statements like this from the article you linked:

The content of databases generally will not show up in a search engine result

This, of course, is true - but pages created from content stored in databases WILL show up, just like any other page. If you link to a page, if the user can get to it by following a link and see its content, then so can a search engine, no problem. It doesn't matter whether it is .asp .php .html or .monkey - if the page has a link to it, search engines do not know or care how it was made, they just suck it up.

How much they like your url is an entirely, and I mean entirely, different issue - so they probably won't like or rank a page called http://www.example.com/w9845709345/pwoeih88/9845-2r45.html as much as one called http://www.example.com/widget-type/widget-details.asp

But it has NOTHING to do with the extension, or the fact that the content was fed in from a database. Nothing at all. As so many have already said.

This isn't opinion. It isn't a matter of opinion. It's fact.

kevhead

2:33 pm on Jan 31, 2006 (gmt 0)

10+ Year Member



Most of you are clinging to one fact you hold as truth and ignoring all others.

Yes, it's very possible to get a dynamic site to index well in the search engines. However, it nearly defeats the purpose of having a dyanic, database-driven site, because you have to put so much work into changing page names and variable names to match the content of your pages. I'm working on a system right now that will do that better and faster.

However, Karlos is 100% accurate when he says that SEs, especially Google, are much more apt to FIND, follow and index an HTML page than a .asp, .php, .jsp or other dynamic page, NOT BECAUSE they are dynamic (yes, they parse plain HTML), but because of what they need to pull data (variables attached to the URL to tell the database what to pull).

Like I said, to build a dynamic site to be just as SE-friendly as a static html site, you would have to put so much work in, you might as well just have built it in HTML. What would be the point? That is, until I finish my new content management system! ;-)

abbeyvet

2:46 pm on Jan 31, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Like I said, to build a dynamic site to be just as SE-friendly as a static html site, you would have to put so much work in, you might as well just have built it in HTML.

Nonsense. It is a simple matter to have URLs written without queries in them and requires no work at all. You are almost certainly on dynamic sites that do this every day, probably the majority of such sites do.

kevhead

4:24 pm on Jan 31, 2006 (gmt 0)

10+ Year Member



Give me an example then.

sharbel

4:45 pm on Jan 31, 2006 (gmt 0)

10+ Year Member



Kevhead, are you kidding? You seriously want examples of sites that rewrite their URLs to mask the query strings?
This 54 message thread spans 2 pages: 54