I am currently rebuilding/upgrading a large non-profit information website and keen to add a site search facility.
One problem is that most pages of the site utilise bootstrap tabs. All the search facilities that I've looked at will provide links to the relevant page, but not with the necessary tab opened. My main menu uses a ?t=x paramter to ensure that the page loads with the correct tab open.
All things considered, I feel it would be best for me to build and host my own mysql index of the pages. However, I need to build a crawler that can work through a list of specified links and then grab just the content for a specific page tab.
Can anyone please offer some guidance and/or hopefully links to code that will help me build this crawler. Thanks in anticipation.