Forum Moderators: open
Search Engine Theme Pyramids
[searchengineworld.com...]
If the content is worth someone else seeing it, then you need to find someone who thinks it's worth spending the time to categorize. It will need to be very very valuable: Speaking from firsthand experience on two such projects, 30,000 articles is a LOT of categorization!
If the content is gathered from specific offline sources: Serialization is better than no categorization at all. Invite interested parties (if you can identify any) to help populate your categories.
If the content is gathered from the net, then read the copyright laws carefully -- now is better than after the lawyers start sending you mail. Make sure you have a very large mailbox.
If you are generating the content yourself ... no, I can't imagine any human on earth knowing enough to write 30,000 pages, and yet not knowing enough to arrange them in some logical order. That you don't know how to arrange them is the strongest evidence that you really won't be able even to collect anything worth collecting.
Now that structure may be alphabetical, or keyword-indexed, or chronological, or topical. But if you have over 3000 pages, it needs to exhibit hierarchy also.
And if you have a structure, then your links should express the structure.
I don't need categories for this sort of project.
How can you possibly have a "link structure" without categories? Coming up with a link structure is all about conceptualizing your categories from the beginning.
Also, you don't want to distribute your PageRank equally... you want to prioritize, and in fact you will have to, unless you decide to have 30,000 links on each page.
Do you even have any idea how big the file would be if you had 30,000 links on one file?! You'll scare the visitors off in no time. No one would scroll through 30,000 links.
That was precisely my point. I thought of using the winking smiley, but I was being a bit more sarcastic and wasn't quite in the winking mood.
The whole project is about hierarchy and prioritization and the management of data. My posts on the thread below about optimizing a form driven site might possibly relate to this topic too, whether or not it's form driven...
How to make a form driven site searchable
[webmasterworld.com...]
However the other option is to have a simple site map where the site map only has the link to the content and each page of around 150 titles.
So your main page is a major site map going down alphabetically to lower site map pages [150 links on main page by 150 links on second-tier pages = 22,500 links]
So the second level would be the links to all your content pages and under this structure it would be evenly spread out through the 22,500 pages.
So the structure is the main pages of 150 links to your second tier site map pages, then the second tier pages each have 150 links to the content pages. You could try 200 links per page to get to your 30,000 but I think you should set up the basic structure to begin and then as you add pages it will fill in over time until it is complete.
Website will have 5 Categories total., so lets take 1 of the 5 categories for example, something like this:
domain.com/index.html links to:
[domain.com...] (Movies - Horror Category)
Now lets say the entire Horror Category is to contain 5,000 pages. What is the best way to pass down PR to all of the 5,000 pages from the /horror category?
Would something similar to this work well:
[domain.com...]
links to:
Article 1-50 (1.html ----> 50.html) located on /horror/index.html
Article 51-100 (51.html -----> 100.html) located on /horror/page2.html
Article 101-150 (101.html ----> 150.html) located on /horror/page3.html
/horror directory will link out to 50 or so pages (page2.html, page3.html, page4.html each containing 50 articles and link back to the home page root of domain.com)
and so on......now would this be a good method to do for PR flow? would doing something like this work? Not sure if I'm confusing anyone here....thanks for everyone's help!
Article 1-50 (1.html ----> 50.html) located on /horror/index.htmlArticle 51-100 (51.html -----> 100.html) located on /horror/page2.html
Article 101-150 (101.html ----> 150.html) located on /horror/page3.html
Not sure if this is what you mean but it sounds like page1 1 - 50 will link to page 2 51 - 100 which will link to page 3 etc etc. I dont like this structure as it will be difficult to get crawled and not pass PR to the lower pages well.
you need /horror/index.html to give the uses links to all of the page2, 3, 4 etc. even if this is a link called... articles 1-50, articles 51-100.
/horror/index.html will have the following links
Articles 1-50 (Link 1)
Articles 51-100 (Link 2)
Articles 101-150 (Link 3)
and so on......
but is 50 articles per page too much to use or?
Also is it best to link back to domain.com/index.html from each one of those article pages or link back to /horror directory, or maybe do both links back to /horror as well as domain.com/index.html? thanks.
As for link upwards. I like to use what is commonly referred to as footprints or breadcrumbs.
This means that you tell the use where they are in the site for instance in the horror page you will have a link like this
home : horror
in articles 1 - 50 you will have
home : horror : Articles 1-50
and in article 34 you will have
home : horror : Articles 1-50 : Article 34
I find this method to be the most user and search engine friendly
30,000 movies dips deep into "B" grade (probably down through about "W", if your grading scale goes that far. So you probably don't want to break down by director -- or maybe just pages for a dozen or few most active directors.
But you do want a chronological breakdown, and with that many, individual years are appropriate.
Alphabetical breakdown, probably by first two letters of movie title.
Separate indexes per country, (or for each "foreign" country, if most of the movies are from the same country.
Separate indexes for "Series" == Friday the Banal part Thirteenth, etc.
Separate indexes for horror themes -- I dunno, I don't touch the stuff: maybe witchcraft, exorcism, pets gone bad, wild animals, computers, IRS audits, ... on a subject like this, the groupings are defined by the data. (Here's where you can show your knowledge of the subject.)