Why is Google indexing this crap when quality sites have sometimes trouble being fully indexed?
Google is not the internet police. It is your responsibility as copyright owner to protect your rights. There is no watermark in text that identifies the real copyright owner to the search engines. The search engines are trying to identify scraped and copied content but there is no 100% guarantee they identify the original source correctly. By the way - who says just because a website was the first to publish a text online it owns the copyright? And how should the search engines know if someone stole your content or has obtained a license for use?
So it is up to you: - Check regularly for copies of your texts. - Check if your content even qualifies for copyright protection. If it exceeds the threshold of originality required by law. - Check if it could be fair use. - Search for contact information and contact the offender. Set a deadline for removal. - If you can't find contact information or you get no responce, file a DMCA complaint with the most important search engines. - If you are sure of your legal position you can also contact the webhost and tell him about the offense.
You should however be aware that if it turns out that your content did not qualify for copyright protection, or the use was fair use after all you could get in serious trouble if the website is taken down.
So better don't contact the webhost or Google about a recipe for boiled eggs.