The OpenRefine team is developing a Wikimedia Commons Reconciliation Service.
Eugene Egbe started working as a junior developer (contractor) for Wikimedia-specific features in this project. Eugene is experienced in Structured Data on Commons development, as he was also the developer behind the popular ISA Tool. For OpenRefine, Eugene develops the Wikimedia Commons reconciliation service and a batch upload tool.
First code has been written for the Wikimedia Commons Reconciliation Service. This service will allow OpenRefine (and tools outside of OpenRefine) to take a list of file names from Wikimedia Commons and to convert these file names to their corresponding entity identifiers (“M numbers” or M-ids - the Wikimedia Commons equivalent of Q-ids). Next, it will then be possible to retrieve Wikitext and structured data from these files, so that this can be processed further. Code is available on Gerrit and the service itself will be available at https://commonsreconcile.toolforge.org/
Antonin has ported the EditGroups tool (which is already quite popular on Wikidata) to Wikimedia Commons: https://editgroups-commons.toolforge.org/. This makes it possible for Commons contributors to undo certain batch edits on Wikimedia Commons, including future 'faulty' batch edits by OpenRefine.
Community outreach
There is now a landing page for OpenRefine on Wikimedia Commons: https://commons.wikimedia.org/wiki/Commons:OpenRefine. For now, this page will point to information about the development process. As features are deployed, the page will point to general information and documentation.