Jump to content

Talk:Mix'n'match/Catalogues

Add topic
From Meta, a Wikimedia project coordination wiki
Latest comment: 3 months ago by Stuartyeates in topic ORCID Public data file
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

I am changing the page to use my new {{MnM}} template. This lets us do things like set a status, specify a catalog, and generate pre-filled scraper links. I started with the films section, please chime in! --Magnus Manske (talk) 11:28, 6 November 2019 (UTC)Reply

Why did you removed AarhusWiki, Ribewiki and KoldingWiki from the list? --Trade (talk) 15:36, 6 November 2019 (UTC)Reply
It wasn't removed, there was a formatting error and it didn't show, it's corrected now. --Adam Harangozó (talk) 20:08, 28 November 2019 (UTC)Reply

new catalog

I am trying to add new catalog to the list. Where is this described. I have a CSV file with URLs and IDs. From an official source (CMS in US). — The preceding unsigned comment was added by EncycloABC (talk) 15:40, 4 December 2019 (UTC)Reply

Scraping

@Adam Harangozó: Can you scrape Sarvavijnanakosam (Malayalam Encyclopaedia) under Encyclopedias (general)|Encyclopedias (general), it uses Mediawiki. I am not too familiar with the tool. Thanks. Gotitbro (talk) 09:04, 15 January 2020 (UTC)Reply

@Gotitbro: Unfortunately I don't know how to create scrapers either, but maybe @Magnus Manske: or @Gerwoman: can help. --Adam Harangozó (talk) 12:47, 28 January 2020 (UTC)Reply
Hi, the scraper didn't seem to respond for this site, perhaps because this url doesn't work http://web-edition.sarvavijnanakosam.gov.in/Special:AllPages but only this one http://web-edition.sarvavijnanakosam.gov.in/index.php?title=Special:AllPages
Anyway, I tried another wiki (https://simpsonswiki.com/wiki/) and I get warnings from the api :
<br />
<b>Notice</b>:  Trying to get property 'query' of non-object in <b>/data/project/mix-n-match/autoscrape.inc</b> on line <b>322</b><br />
<br />
<b>Notice</b>:  Trying to get property 'allpages' of non-object in <b>/data/project/mix-n-match/autoscrape.inc</b> on line <b>322</b><br />
<br />
<b>Warning</b>:  Invalid argument supplied for foreach() in <b>/data/project/mix-n-match/autoscrape.inc</b> on line <b>322</b><br />
<br />
<b>Notice</b>:  Undefined offset: 0 in <b>/data/project/mix-n-match/autoscrape.inc</b> on line <b>284</b><br />
{"status":"OK","data":{"html":"","log":["0 (AutoScrapeLevelMediaWiki): Reset [{\"mode\":\"mediawiki\",\"url\":\"https:\\\/\\\/simpsonswiki.com\\\/wiki\\\/Special:AllPages\",\"pos\":0,\"apfrom\":\"\"}]","0 (AutoScrapeLevelMediaWiki): Reset [{\"mode\":\"mediawiki\",\"url\":\"https:\\\/\\\/simpsonswiki.com\\\/wiki\\\/Special:AllPages\",\"pos\":0,\"apfrom\":\"\"}]","Found 0 entries."],"results":[],"last_url":""}}
So I think only devs can answer that. Eru (talk) 18:16, 29 January 2020 (UTC)Reply

Connected catalogues

@Magnus Manske: When adding new catalogues, should we use a new column for noting if a site refers to another external ID which could be scraped by the auxiliary matcher? For example [1] lists the GND number at the bottom. Would this help? — The preceding unsigned comment was added by Adam Harangozó (talk) 21:41, 30 January 2020 (UTC)Reply

Can't figure out scraping

Hi, I'd like to add ITIS as a database to Mix'n'match, but I think scraping it is a bit beyond my technical know-how. If anyone has the knowledge and time to do so, that would be great. Thanks, Enwebb (talk) 19:12, 13 May 2020 (UTC)Reply

Statues Vanderkrogt from static to auto-scraped catalog?

Hello fellow Mixers & Matchers! Would someone (TM) - perhaps Jean-Fred? - be able to help with the following? Statues Vanderkrogt is an excellent catalog of public art in MnM. I imported it once statically myself; but it's outdated and has grown quite a bit over the years. Scraping is beyond my own know-how too, and I was wondering if someone could either transform the current one to an auto-scraped one, or delete the current one after creating a new (auto-scraped) edition?

This website does have some peculiarities, as it operates over two domain names; but the identifiers from one domain name (vanderkrogt.net) are universally applicable. Spinster (talk) 13:51, 17 June 2023 (UTC)Reply

NNP

I've tried reading the instructions but there's too much of a learning curve - not just the terminology, but there's too many concepts with which I'm unfamiliar.

Should the Newman Numismatic Portal be incorporated into these lists? Thanks. DS (talk) 20:40, 8 September 2023 (UTC)Reply

ORCID Public data file

@Magnus Manske: I'm surprised that the ORCID public data file at https://support.orcid.org/hc/en-us/articles/360006897394-How-do-I-get-the-public-data-file is not already in the list of catalogues. I'm aware that it's not complete (since subjects can set info to private), but it seems worthwhile. Stuartyeates (talk) 22:39, 17 February 2024 (UTC)Reply