- Joined
- Oct 7, 2014
- Messages
- 460
- Likes
- 684
- Degree
- 2
I've got to thinking lately a lot about duplicity on web properties. My main focus right now has been getting my technical understanding to a decent level of both on-page considerations and of course I'm trying to learn more about GSA SER and how to use that properly and understanding more about that.
Because of this I've been thinking about how not only duplicate content is an issue, but how duplicity is specifically covered within a number of Google's patents, treating individual web pages as nodes / properties and how they are relationally similar to other 'nodes'.
E.g. If money site A is linked to from web pages B and C what do B and C have in common with each other and what does A have in common with B and what does A have in common with C.
This is a super simplified way of looking at Google's relevancy factoring and we can also assume their janitors/crawlers are using this kind of logic as well to detect footprints that look like an attempt at manipulating their SERPs. We know they have the patents.
I feel like a lot of people are ignoring potential footprints that are easy to identify and that footprints have become too intwined with talking about PBN's except for of course things like diversity, duplicate content, anchor text ratios et all.
There's so many more we don't take into consideration, maybe they're fringe considerations, but surely they're going to become more important in the future? For longer term projects where you're not going down a squeaky clean, minimal manipulation route I believe this stuff really matters.
I know that there are people here doing spam very smartly and I'd love to open all of this up as a topic of discussion, not just talking about spam, but how you treat any web property to avoid these issues of duplicity, footprints and the like.
- Content
- On-Page Factors
- Ownership (Cookies, IP, Flash Tracking - is this important all the time? How much can Google possibly know from third-party databases?)
- Link Diversity
- Ratios (including property - property ratio footprints, if this matters?)
- Plus anything else you believe, know and think of.
I'm going to continue sinking my teeth into patents, testing in environments where I control as many variables as possible etc, but I believe it would be great to hear the thoughts of those who I consider to be pros.
I welcome anyone to contribute, but I hope this can be a technical discussion without people parroting what they've heard elsewhere without the understanding and willingness to explain their understanding to back up what they're saying.
- RF
Because of this I've been thinking about how not only duplicate content is an issue, but how duplicity is specifically covered within a number of Google's patents, treating individual web pages as nodes / properties and how they are relationally similar to other 'nodes'.
E.g. If money site A is linked to from web pages B and C what do B and C have in common with each other and what does A have in common with B and what does A have in common with C.
This is a super simplified way of looking at Google's relevancy factoring and we can also assume their janitors/crawlers are using this kind of logic as well to detect footprints that look like an attempt at manipulating their SERPs. We know they have the patents.
I feel like a lot of people are ignoring potential footprints that are easy to identify and that footprints have become too intwined with talking about PBN's except for of course things like diversity, duplicate content, anchor text ratios et all.
There's so many more we don't take into consideration, maybe they're fringe considerations, but surely they're going to become more important in the future? For longer term projects where you're not going down a squeaky clean, minimal manipulation route I believe this stuff really matters.
I know that there are people here doing spam very smartly and I'd love to open all of this up as a topic of discussion, not just talking about spam, but how you treat any web property to avoid these issues of duplicity, footprints and the like.
- Content
- On-Page Factors
- Ownership (Cookies, IP, Flash Tracking - is this important all the time? How much can Google possibly know from third-party databases?)
- Link Diversity
- Ratios (including property - property ratio footprints, if this matters?)
- Plus anything else you believe, know and think of.
I'm going to continue sinking my teeth into patents, testing in environments where I control as many variables as possible etc, but I believe it would be great to hear the thoughts of those who I consider to be pros.
I welcome anyone to contribute, but I hope this can be a technical discussion without people parroting what they've heard elsewhere without the understanding and willingness to explain their understanding to back up what they're saying.
- RF