Skip to content

Cleaning data

Thomas Francart edited this page Jul 3, 2020 · 3 revisions

Cleaning previously loaded using ISSNbot

The clean command

To clean data invoke the clean_serials command:

java -Dpassword=xxxxxx -jar issnbot-app-0.9.1-onejar.jar clean_serials

As with the load_data command, the update flag is necessary to have updates really sent to Wikidata, otherwise this is only a dry run.

Cleaning data by providing a list of QIDs

If you want to clean an explicit list of items, use (and repeat) the qid=Q123456 parameter. e.g.

java -Dpassword=xxxxxx -jar issnbot-app-0.9.1-onejar.jar clean_serials qid=Q123 qid=456 qid=789

Cleaning data when ISSN have changed in Wikidata

Otherwise, run the command without any parameters to clean entries with a title (P1476) having a reference with an ISSN value, but where the same ISSN value is not found on the item anymore in a ISSN (P236) property.

The exact SPARQL query used is here in the source, and you may want to execute it before running the clean command so that you know which items will be cleaned exactly.

What is cleaned exactly ?

  • Title (P1476) with an ISSN reference
  • Language (P407) with an ISSN reference
  • Place of publication (P291) with an ISSN reference
  • Official websites (P856) with an ISSN reference
  • ISSN-L (P7363) with an ISSN-L reference

ISSN statements (P236), labels and aliases remain untouched.

When we say "with an ISSN reference", it means a reference consisting of 2 parts:

  • Stated In (P248) = ISSN Register (Q70460099)
  • ISSN (P236) = The ISSN value from which this information was derived

Clone this wiki locally