소규모 위키 모니터링 팀/IRC

From Meta, a Wikimedia project coordination wiki
This is an archived version of this page, as edited by 리듬 (talk | contribs) at 05:28, 21 October 2021 (Created page with "사용자를 허용 목록에 추가합니다(사용자는 계속 보고되지만 더 이상 '개수'가 제공되지 않음)."). It may differ significantly from the current version.

소규모 위키 모니터링 팀(SWMT, Small Wiki Monitoring Team)에는 리베라 챗 #cvn-sw접속IRC 채널이 있으며 CVNBot다양한 소규모 위키에 대한 문서 훼손 가능성을 보고합니다.

CVNBot

이들은 각각 많은 위키를 모니터링하는 CVNBot(이전에는 SWMTBot으로 알려짐) 인스턴스입니다. 그들은 모두 CVNBot 명령을 이해합니다. 이러한 봇 중 하나에 전역 목록 관련 명령을 실행하면 다른 봇이 자동으로 업데이트됩니다(봇은 숨겨진 채널을 통해 서로 통신함).

자세한 내용은 문서훼손 대응 네트워크의 봇을 참조하세요.

과거에 이러한 봇을 운영해 주신 Az1568, Melos, rxy에게 무한한 감사를 드립니다!

ISObot

요청 시 ISO 639 코드 번역을 수행합니다.

명령

명명 설명 예시
ISObot whatis 코드 제공된 ISO 639 코드에 해당하는 언어의 이름을 영어로 반환합니다(2자 및 3자 코드 모두 허용됨). ISObot whatis tlh

AvicBot

언어 코드 번역 및 편리한 링크 제공

명령

명명 설명 예시 결과
!commands 이 명령 목록을 제공합니다 !commands (this list)
!say AvicBot이 무언가를 말하게 만듭니다 !say Hello Hello!
!lang 주어진 언어 코드에 해당하는 언어의 이름을 영어로 반환 !lang sco? sco is Scots!
!link avicbot.org 짧은 도메인을 사용하여 주어진 페이지에 대한 링크를 반환 !link sco:Main_Page http://avicbot.org/sco:Main_Page
!guc 지정된 사용자에 대한 전역 사용자 기여를 반환 !guc AvicBot https://tools.wmflabs.org/guc/?user=AvicBot&blocks=true
!cauth 지정된 사용자에 대한 중앙공지 페이지를 반환 !cauth AvicBot http://meta.wikimedia.org/wiki/Special:CentralAuth/AvicBot
!die 강제로 AvicBot이 채널을 분리 - 누구나 사용 가능 !die AvicBot -

링크 감시자

#cvn-sw-spam접속#wikimedia-external-links접속에는 '링크 감시자' 봇이 있습니다. 이 채널의 봇은 Beetstra이 실행합니다. 봇은 Shadow1Beetstra이 개발했습니다. Shadow1가 사용 중지되었으므로 봇은 Beetstra에서만 유지 관리됩니다.

데이터는 데이터베이스에 저장됩니다. 특정 정보가 필요하면 Versageek 또는 en:User:Beetstra에게 문의하세요.

봇은 irc.wikimedia.org의 730개 이상의 위키미디어 채널에 대한 10개의 하위 봇과 리베라의 3명의 보고자(LiWa3_1, LiWa3_2 및 LiWa3_3)가 있는 메인 프로그램으로 존재합니다. 많은 하위 봇이 연속 작동(거의 실시간으로 작동)을 보장하고, 하나의 리더 봇에는 읽을 내용이 너무 많고, 하나의 보고서 봇에 대해 보고하기에는 너무 많습니다.

봇은 irc.wikimedia.org에 있는 총 738개의 위키에서 차이점을 읽습니다(특수 사항은 무시하고 sitematrix 참조). 차이가 구문 분석되고 추가된 외부 링크가 봇에 의해 보고됩니다. 특정 위키에서 봇을 원하시면(irc.wikimedia.org에 IRC 채널이 필요합니다), 토론 페이지에서 저에게 연락해 주세요). 위키에 봇이 링크 추가 또는 경고를 보고할 수 있는 자체 채널이 있는 경우 특정 메시지를 특정 채널로 넘겨주기할 수 있습니다(예: #wikipedia-de-spam접속#cvn-wp-es접속 참조).

명령

봇은 데이터베이스를 공유하므로 하나를 명령하면 다른 봇에도 추가됩니다.

명명 설명 예시
LiWa3_1 user add 사용자명 사용자를 허용 목록에 추가합니다(사용자는 계속 보고되지만 더 이상 '개수'가 제공되지 않음). EnLinkWatcher2 user add Beetstra
LiWa3_1 user del Username Removes a user from the whitelist user del Beetstra
LiWa3_1 link wl add regex adds a link to the whitelist link wl add \bexample\.com
LiWa3_1 link wl del regex removes a link from the whitelist link wl del \bexample\.com
LiWa3_1 link rl add regex adds a link to the redlist link rl add \bexample\.com
LiWa3_1 link rl del regex removes a link from the redlist link rl del \bexample\.com
LiWa3_1 link nr add regex adds link to noredir list so as to not further expand link nr add \bexample\.com
LiWa3_1 link nr del regex removes link from noredir list link nr del \bexample\.com
LiWa3_1 link nr search regex searches for link in noredir list link nr search \bexample\.com
!info statistics !info


  • Commands can be issued to one of the other LiWa3 bots as well (LiWa3_2, LiWa3_3, ...). Note that the bot answering may not be the one that is being commanded
  • Whitelist is for 'good' links. Please note that the link-additions by this user are still logged.
  • RedList for links which need attention

Blacklist for bad links (on #wikipedia-en-spam these are transferred to AntiSpamBot and reverted, except when the user is whitelisted).

LiWa3 output

The bot gives one line per added link in the format:

  • [[<wiki>:<pagename>]] <diff link> [[<wiki>:User:<user>]] <added link> (<link data>) <counts> (< for wikidata: English descriptor>)

The link may be followed by link concerns ('<link data>'), and will be followed by a set of 2 or 4 numbers ('<counts>'). On Wikidata, the English descriptor is appended.

The <link data> (which is not always present):

  • Whether the link is on a revert list ('BL'), red/monitor list ('RL') or automatic red list ('ARL', where LiWa3 detected that statistics earlier passed some addition threshold) or white list ('WL' - if showing whitelisted links is turned on)
  • STATUS ###: page status (a.o. the well known '404')
  • REDIR <link>: the site is a redirect site. Redirect target may get a separate set of BL/RL/ARL/WL tags if the redirect target matches rules.
  • Whether the page has tracking data on them:
    • WB (Urchin account number issued by Google)
    • FS (stopForumSpam, a email spam tracking site; the reported percentage is how forumspam evaluates the domain).
    • CB (clickbank)
    • AS (AdSense)
    • PL (Plimus)

The four numbers in <counts>:

  1. how many links did this user add.
  2. how many times was this link added.
  3. how many times did this user add this link. Turns blue when that is higher than 5, turns red when higher than 25. The number may be followed by "!!" (in red), in that case more than 90% of this link was added by this user, "!" in red (more than 66%) or by "?" (more than 33%).
  4. to how many wikipedia did this user add this link. Turns blue when higher than 2, turns red when higher than 5. The number may be followed by "!!" (in red), in that case more than 90% of this link was added interwiki by this user, "!" in red (more than 66%) or by "?" (more than 33%).

Sometimes only two numbers are shown:

  1. how many links did this user add.
  2. how many times was this link added.

That means that either the number of links the user added is larger than the setting largenumberoflinkadditions, or that the number of times the domain was added is larger than the setting largenumberoflinks (see User:LiWa3/Settings). When either reaches that threshold further numerical statistics are generally useless, and are only a strain on the database.

Sometimes the second number is replaced with 'NC', which means that that domain is on the 'do not count list'. That list is used for domains that need to be monitored (including those that are revertlisted) but where the count does not yield any information (e.g. youtube.com, where millions of additions are recorded but where link additions are sometimes reason for concerns, see e.g. en:WP:YOUTUBE).

COIBot

COIBot ('Conflict Of Interest Bot') was originally developed to catch overlaps between usernames with either the page they edit, or with an external link they added. It now also does more sophisticated checks to catch conflicts of interest (as defined on the en-wikipedia, which discourages such edits strongly), and monitors the additions of external links as reported by the LinkWatchers (vide supra).

In short, COIBot reports when:

  • Username overlaps with the pagename the user just edited;
  • Username overlaps with the domain of the external link added;
  • IP of user is close to the IP of the computer the webserver for the domain of the external link is hosted on;
  • IP/IP-range/Username is blacklisted for a certain pagename;
  • IP/IP-range/Username is blacklisted for a certain domain;
  • The domain/IP of domain is on COIBot's monitorlist.

Further:

  • Users/IPs who get caught accidentally, or for who the conflict of interest has been discussed (and been resolved) can be whitelisted against a page or domain.
  • Users/IPs can be whitelisted completely, so none of their edits will be recorded.

Notes:

  • COIBot automatically puts links which get caught because of IP/Username overlap on its monitorlist. So every next addition will be recorded. This 'automonitoring' of the link can be disabled for certain links. After this it will still report the link when COIBot calculates overlap, but will not put the domain on the monitorlist afterwards.
  • COIBot automatically puts links that have been reported to English Wikipedia Spam Project (in the spamlink template), or been blacklisted on one of the local blacklists, or on the meta-blacklist, on its monitorlist. Hence, if a link gets blacklisted on e.g. the Spanish wikipedia, and someone then decides to start adding it to another wikipedia, COIBot will report the additions of that link.

Commands

Most commands on COIBot are 'cloak-controlled'. If you have a cloak, and that cloak is on COIBot's trusted list, you can perform the commands below that are tagged with an asterisk. Everyone can use the other commands. To be added to the trusted-list of COIBot, ask Dirk Beetstra.

CP Command Description Example
Help and test commands
help shows your rights, and which commands are available (quite old) help
help command shows some more help specifically to the command you want to perform (quite old) help wl
test string1 string2 show overlap between the two strings using COIBots overlap calculation test Beetstra example.com
Monitoring commands
* monitor domain domain reason Adds the domain to the monitorlist, matched against the domain of an added external link, and edit is reported when matched (preferred command) monitor domain example.com
* ml del regex Deletes the regex from the monitorlist ml del \bexample\.com
ml search regex searches if the regex is on the monitorlist ml search \bexample\.com
Blacklist commands — for linkwatcher
* bl add username string connects the text string to the username, COIBot reports when overlap string <-> page edited or string <-> domain is detected bl add Beetstra example
* bl del username string Deletes the connection from the blacklist bl del Beetstra example
bl search username string searches if username is connected to string on the blacklist bl search beetstra example
bl search user username searches if username is connected to any text on the blacklist (if so, shows which strings) bl search user beetstra
bl search string string searches if string is connected to any user on the blacklist (if so, shows users) bl search string example
Whitelist commands — for linkwatcher
* wl add username string connects the text string to the username, COIBot ignores when string is pagename edited or string is domain added wl add Beetstra example
* wl del username string Deletes the connection from the whitelist wl del Beetstra example
wl search username string searches if username is connected to string on the whitelist wl search beetstra example
wl search user username searches if username is connected to any text on the whitelist (if so, shows which strings) wl search user beetstra
wl search string string searches if string is connected to any user on the whitelist (if so, shows users) wl search string example
* wl add username * globally whitelists the user, all edits by user will be ignored wl add Beetstra *
* wl add * domain disable automonitoring for domain. COIBot will report conflicts of interest regarding the domain, but the link will not automatically be added to its monitorlist wl add * example.com
Tool commands — consists of commands to access the database created by the LinkWatchers, and other quick-commands giving information on external links and users. The list of commands gets expanded on a regular basis, entering COIBot help in a channel where COIBot is active may give a more up-to-date list.
Report generation commands — reported on wiki
report link domain COIBot creates a report of all additions of domain in its database. Reports are saved to en:Wikipedia:WikiProject Spam/LinkReports and User:COIBot/LinkReports (bottom of list) report link example.com
report ip ip COIBot creates a report of all additions of domains on ip in its database. Reports are saved to en:Wikipedia:WikiProject Spam/LinkReports and User:COIBot/LinkReports (bottom of list) report ip 127.0.0.1
report xwiki domain COIBot creates a report of all additions of domain in its database. Reports are only saved to User:COIBot/LinkReports (bottom of list). This is faster than the 'report link' report xwiki example.com
report local domain COIBOt creates a report for local link additions of domain in its database. Reports are saved to User:COIBot/Local. report local example.com
report user username COIBot creates a report of all edits performed by user username in its database. Reports are saved to en:Wikipedia:WikiProject Spam/UserReports and User:COIBot/UserReports (bottom of list) report user Beetstra
report page pagename COIBot creates a report of all edits performed on page pagename in its database. Reports are saved to en:Wikipedia:WikiProject Spam/PageReports and User:COIBot/PageReports (bottom of list) report user Beetstra
quickcreate domain COIBot creates a XWiki report and it will be saved to User:COIBot/XWiki quickcreate example.com
Query commands — data returned in IRC
resolve domain show the IP of the server the domain is hosted on resolve example.com
whoadded domain Gives the top ten of people who added domain whoadded example.com
whoadded domain id_string Gives the top ten of people who added the domain with the id_string in the full url whoadded youtube.com qxdjdhd563
ipadded domain Gives the top ten of IPs that added domain ipadded example.com
whatadded username Gives the top ten of domains added by user whatadded Beetstra
whenadded link domain Gives the timeframe and top ten of dates when domain was added whenadded link example.com
whenadded user username Gives the timeframe and top ten of dates when user was adding links whenadded user Beetstra
whereadded link domain Gives the top ten of wikis where domain was added whereadded link example.com
whereadded user username Gives the top ten of wikis where user was adding links whereadded user Beetstra
domainsadded domain Gives the top 10 of domains on the server of domain domainsadded example.com
whatserveradded username Gives the top 10 of servers the user added whatserveradded Beetstra
count link domain Gives the number of times the domain was added count link example.com
count user username Gives the number of times user added a link count user Beetstra
research link domain Gives statistics etc. for domain research link example.com
research user username Gives statistics etc. for user (not fully implemented yet) research user Beetstra
convert domain converts the domain/link into a regex convert example.com
top links
top # links
lists the top 5 (or #) links (that are not whitelisted or on the do-not-count list) recorded by COIBot top links
top users
top # users
lists the top 5 (or #) users (who are not whitelisted) recorded by COIBot top links
adsense domain queries AdSense, Web bug, Plimus and Clickbank databases adsense example.com
added adsense adsense_id
added clickbank clickbank_id
queries respective database to find all domains with relevant id added adsense (string)
added clickbank (string)
recent domain
recent # domain
lists recent additions for a domain for past 2 (or #) days recent 2 example.com
Push commands
* do not count domain stops COIBot from counting the number of links to a domain do not count example.com
* clear domain removes domain from redlist and poke, and adds domain to whitelist clear example.com
* read blacklist domain refreshes the blacklist from the specified wiki, useful when fixed and immediate update required read blacklist ms.wikipedia.org
* read whitelist domain refreshes the whitelist from the specified wiki read whitelist ms.wikipedia.org
* read revertlist en.wikipedia.org refreshes the revertlist from en.wikipedia (specific for this wiki only) read revertlist en.wikipedia.org
  • Some commands can be given without directing the command at COIBot (i.e. 'research link example.com' in stead of 'COIBot research link example.com').
  • When a link gets added which is on the monitorlist, gets matched with the username/IP that is adding the link, COIBot also creates a linkreport for that link in in English Wikipedia and in its Meta subpage (bottom of list).

wm-bot

Main article: Wm-bot

wm-bot is an IRC bot which has several commands. The ones everyone can use in #cvn-sw are listed in @info, and include !luxo and !ca. The bot also supports @link [[zh:b:foo]] linking.