or
or
By clicking below, you agree to our terms of service.
New to HackMD? Sign up
Syntax | Example | Reference | |
---|---|---|---|
# Header | Header | 基本排版 | |
- Unordered List |
|
||
1. Ordered List |
|
||
- [ ] Todo List |
|
||
> Blockquote | Blockquote |
||
**Bold font** | Bold font | ||
*Italics font* | Italics font | ||
~~Strikethrough~~ | |||
19^th^ | 19th | ||
H~2~O | H2O | ||
++Inserted text++ | Inserted text | ||
==Marked text== | Marked text | ||
[link text](https:// "title") | Link | ||
 | Image | ||
`Code` | Code |
在筆記中貼入程式碼 | |
```javascript var i = 0; ``` |
|
||
:smile: | ![]() |
Emoji list | |
{%youtube youtube_id %} | Externals | ||
$L^aT_eX$ | LaTeX | ||
:::info This is a alert area. ::: |
This is a alert area. |
On a scale of 0-10, how likely is it that you would recommend HackMD to your friends, family or business associates?
Please give us some advice and help us improve HackMD.
Do you want to remove this version name and description?
Syncing
xxxxxxxxxx
OpenRefine - Wikimedia Commons: integration scenarios
We give an overview of what it should take to build a Commons integration in OpenRefine, starting with a minimalistic scenario (Goal A) and then a more ambitious one (Goal B).
Goal A: structured metadata editing
We make minimal changes to the Wikibase extension to let it edit MediaInfo entities in the same way that it currently edits items.
Overview of the user workflow
Users would start off from tables like this one, where one column contains filenames of Commons media files.
The
Title
column can be "reconciled" to Commons, which checks that files with those names exist and extracts the Mid out of them. Then, this column can be used in a Wikibase schema to add metadata on these files. Other columns can be reconciled to Wikidata and used as values in statements on MediaInfo entities.Development effort
This consists in four steps. Time estimates are for one full time developer.
With this plan, users should be able to edit structured metadata on existing files in Commons. They should be able to do the same on other instances of MediaWiki which add structured data to their media files (do we know of any?).
Goal B: support for file upload
This builds on Goal A and adds the ability to upload new files to Commons.
Overview of the user workflow
Start with a table where one of the columns contains paths to media files:
Path
column, indicating the path of the file to upload. If this field is used in combination with a matched cell in theTitle
column, then this uploads a new version of the file (a QA warning should be generated to make sure people don't do this unintentionally).Then, for the upload itself, we have two scenarios (we could support one or both of them):
B1: Direct upload from OpenRefine
Just like the "Performs edits on Wikibase" operation, we can do the file uploads directly from OpenRefine as a long-running operation. Once the operation completes, the reconciled cells in the
Title
column are all matched to the newly-created Mids.B2: Export to a batch format which is ingested by another tool
Just like the QuickStatements export currently offered, we could export the batch to a file format which would embed both the structured metadata, the wikitext and the media files themselves.
This file could then be uploaded in one go to Commons itself or an external tool similar to QuickStatements which would do the upload in the background.
We are not aware of any such file format but we could design one in concertation with other stakeholders (WMF, batch upload tool maintainers, Wikimedia Commons community)
Development effort
This builds on top of Goal A: the steps listed there are prerequisites for this goal.
We need to add new fields to the schema (file path and wikitext) which only make sense for MediaInfo entities, therefore we need proper support for different entity types. It makes sense to develop this support for Wikibase in general anyway (making it easier to add support for editing Properties or Lexemes for instance).