Research talk:Are the bots really fighting/Work log/2017-03-24

From Meta, a Wikimedia project coordination wiki

Friday, March 24, 2017[edit]

Updated the interwiki part of the comment parser in this Jupyter notebook, so that it looks for language codes bordered by two punctuation marks from [](){},: or one punctuation mark and one space. Beginning and end of a comment string counts as a space, not a punctuation mark. This appears to catch almost all of the comments that were suspected, leaving only 3.32% of the comments uncategorized across all namespaces and 0.99% uncategorized in ns0. Staeiou (talk) 21:04, 24 March 2017 (UTC)[reply]