- Joined
- Sep 28, 2023
- Messages
- 5
- Likes
- 5
- Degree
- 0
I got hit by the HCU update and I've been trying to diagnose issues on my site.
Digging into Search Console I've noticed a couple of indexing issues.
Chinese/Japanese Internal Spam
My site has been hit with the internal Asian spam attack, where they just search for long spammy results through your websites search function.
In my Search Console indexing report, my site has 45K Excluded by 'noindex' tag errors, of which the vast majority are all things like:
And, about 35K of similar results showing up under my Crawled - currently not indexed report.
Apparently, this attack is common. And, all of the advice I've read says it's no big deal if they're not index.
But Google keeps trying to crawl them and a few of them have even made it into the Indexed report.
Has anyone else dealth with this? WTF do you do when Google indexes stuff you tell them not to index?
Weird Image Indexing - Combining Different Images from Different Sites
This one is kind of weird. Google is trying to index image URLs on my website that combine my image URL with two different image URLs from COMPLETELY different sites.
It looks like this:
I had been using Nitro CDN and the only thing I could think was that they were somehow combining images for their different users. But I don't think that could be it because some of the URLs that were being combined with my images were from Amazon and I doubt they're using Nitro CDN.
Anyone else have this? The easiest way to check is to go to Settings in Search Console, open your crawl stats report, and look under Crawl resuests: Image.
Digging into Search Console I've noticed a couple of indexing issues.
Chinese/Japanese Internal Spam
My site has been hit with the internal Asian spam attack, where they just search for long spammy results through your websites search function.
In my Search Console indexing report, my site has 45K Excluded by 'noindex' tag errors, of which the vast majority are all things like:
Code:
https://mysite.com/?xe=dendv&s=สล็อตเว็บตรง ฝาก-ถอน true wallet ไม่มีขั้นต่ํา(~PG99.Asia~),สล็อตเว็บตรง ฝาก-ถอน true wallet ไม่มีขั้นต่ํา(~PG99.Asia~),สล็อตเว็บตรง ฝาก-ถอน true wallet ไม่มีขั้นต่ําxe8
And, about 35K of similar results showing up under my Crawled - currently not indexed report.
Apparently, this attack is common. And, all of the advice I've read says it's no big deal if they're not index.
But Google keeps trying to crawl them and a few of them have even made it into the Indexed report.
Has anyone else dealth with this? WTF do you do when Google indexes stuff you tell them not to index?
Weird Image Indexing - Combining Different Images from Different Sites
This one is kind of weird. Google is trying to index image URLs on my website that combine my image URL with two different image URLs from COMPLETELY different sites.
It looks like this:
Code:
https://mysite.com/wp-content/uploads/2019/10/example-of-my-imge.jpgΩΩΩhttps://completelydifferentsite1.com/wp-content/uploads/2019/10/a-totally-different-image-url.jpgΩΩΩhttps://completelydifferentsite2.com/wp-content/uploads/2019/10/yet-another-totally-different-image-url.jpg
I had been using Nitro CDN and the only thing I could think was that they were somehow combining images for their different users. But I don't think that could be it because some of the URLs that were being combined with my images were from Amazon and I doubt they're using Nitro CDN.
Anyone else have this? The easiest way to check is to go to Settings in Search Console, open your crawl stats report, and look under Crawl resuests: Image.