Skip to content

Commit c4d34bb

Browse files
committed
skip article with distorting artificial traffic
1 parent bcf4eec commit c4d34bb

1 file changed

Lines changed: 4 additions & 1 deletion

File tree

top/get_data.py

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -70,8 +70,11 @@ def is_article(title, wiki_info):
7070
Is it an article, or some other sort of page? We'll want to filter out the
7171
search page (Special:Search in English, etc) and similar pages appearing
7272
inconveniently in the traffic report.
73+
74+
Skip xhamster. There are a few clues this Wikipedia traffic is artificial.
75+
See https://en.wikipedia.org/w/index.php?title=XHamster&diff=701682670&oldid=700826198
7376
'''
74-
skip = ['-', '404.php'] + [wiki_info['mainpage']]
77+
skip = ['-', '404.php', 'XHamster'] + [wiki_info['mainpage']]
7578
prefixes = PREFIXES + wiki_info['namespaces']
7679
if title in skip:
7780
return False

0 commit comments

Comments
 (0)