<div style="display:flex; flex-direction:column;"> <div> </div> <div style="font-size:65%;padding:32px"> # From cross-linking multilingual articles to database entries, how does Wikidata link to the whole world's knowledge - Taiwan Experience </div> <!-- Put the link to this slide here so people can follow --> <div style="font-size:16px;display:flex;background-color:rgb(157 195 145/0.5);padding:32px;justify-content:flex-end;"> <div style="flex-direction:column; text-align:left"> <div>slide:https://hackmd.io/@wikidata-tw/B14060lyn</div> <div>CC-BY-4.0</div> <div>Wikidata Taiwan Community</div> <div>2023/08/17</div> </div> </div> </div> Note: It is quite a honor to talk about Taiwan Wikidata experience. I am Dennis from Wikidata Taiwan, I will talk about the beginning and the current projects in Taiwan --- ## Who am I? - [Supaplex](https://www.openstreetmap.org/user/Supaplex) - [OpenStreetMap](https://www.openstreetmap.org) :heart: [Wikidata](https://www.wikidata.org) :heart: - [Wikimedia Taiwan](https://meta.wikimedia.org/wiki/Wikimedia_Taiwan/zh) :cat: Note: Online ID Supaplex, I'm an active Wikidata and OpenStreetMap projects contributors, and also the board member or Wikimedia Taiwan --- ## Fight Vandalism ![](https://hackmd.io/_uploads/SkuKHuI22.png) <!-- ![](https://hackmd.io/_uploads/SygIHdU3h.png) --> Note: As a community member, I am fight vandalism on Wikidata. --- ## Also Fight Vandalism on OSM ![](https://i.imgur.com/k8tRdw9.png) Note: And fight vandalism on OpenStreetmap too. Sometimes Chinese people are making unrealistic edit, for example, cross Taiwan Strait Railway. --- ## Why should We Pay Attention to Wikidata? * From Wikipedia Articles form to Machine-readable * Cross-language ability * Both International Languages and Regional Languages * A Database of Databases: Indexing other Database's entries, like: OpenStreetMap, School code, River code Note: Wikidata is 10 years old, the structure approach for Wikidata is a key factor. The main feature for Wikidata is machine readable and language-neutrual. Wikidata is a multilingual project, which covered both international languages and regional languages. Wikidata is also a databae of databases which indexing third-party database's entries, like OpenStreetMap, Taiwan School Code, Taiwan River Code --- ## The start of Wikidata in Taiwan - COSCUP 2013 {%youtube dE52LLcUhYk %} Note: In 2013, one of the bigest Open Source conference in Taiwan COSCUP, invited Wikidata Project Manager Lydia from Wikimedia Deutschland to intruce Wikidata. It is one of the earliest talk about Wikidata in Taiwan --- ## The History of Wikidata in Taiwan * 2013: Lydia's Talk in Taipei * 2014~2019: Not active. Scholurs like Tyng-Ruey Chuang are more care about Wikidata * 2019: Community is more fond of Wikidata itmes and schemes: Started OpenStreetMap x Wikidata Monthly Meetup in Taipei * Mass Import of Laws, Villages, Schools, Librarys, Episodes of Dramas, Government Publications, Research Papers Note: 2013 could be consider the start of Wikidata in Taiwan. But during the time of 2014-2019, there is not much Wikidata activies, only scholar like Tyng-Ruey Chuang have some publications or attended Wikidatacon. But in 2019, OpenStreetMap x Wikidata began monthly meetup and start to have discussion about the tagging scheme. And also mass import laws, villages, schools, librarys, episodes of dramas, government publications, research papers etc. --- ## OpenStreetMap x Wikidata ![OpenStreetMap x Wikidata](https://upload.wikimedia.org/wikipedia/commons/c/c5/WdOsm-semanticBridge.jpg) Note: Started in 2019, the recently one is the day before yasterday in Mozspace Taipei --- ## Example - Mozspace Taipei <div style="display:inline-flex;align-items:left;"> <div left> ![](https://hackmd.io/_uploads/Hys8R4_Fn.png) </div> <div style="font-size:85%;display:flex;background-color:rgb(127 195 140/0.5);padding:1px;justify-content:flex-end;"> <div style="flex-direction:column; text-align:left"> &nbsp; * intense of: Venue * Official website: [https://moztw.org/space/](https://moztw.org/space/) * OSM node ID: 5773168030 </div> </div> </div> --- ## Wikidata on Map: OpenSteetMap * Village: type=boundary * River: type=waterway * School: amenity=school * Train, Metro, Light Rail Station: railway=station --- ## [Wikidatacon 2023 @Taiwan ](https://wikidatacon.tw/) ![](https://hackmd.io/_uploads/SktEC-zj3.png) --- ## Difference of Items with Coordinate in Taiwan between 2021 and 2023 ![Taiwan gif](https://addshore.com/wp-content/uploads/2023/07/2021-2023-Taiwan-diff.gif) Link: [Wikidata Map in 2023](https://addshore.com/2023/07/wikidata-map-in-2023/) --- ## Wikidata.org * https://www.wikidata.org * Establish in 2012 October 29 * [Wikipedia](https://en.wikipedia.org/) <-Multimedia-> [Wiki Commons](https://commons.wikimedia.org/wiki/) * Wikipedia <-Structured Data-> Wikidata * Made all Human Intelligent Structured-enable * Until August 2023, there are total [105,877,548 items](https://www.wikidata.org/wiki/Special:Statistics), total [137.8 GB](https://dumps.wikimedia.org/wikidatawiki/20230701/) Note: 一般較 Wikidata 翻譯做維基數據,是 2012 十月成立的,所以今年是十週年。伊運作的方式親像講維基共享資源存多媒體檔案,維基數據是存結構的資料。維基數據是欲存全人類的智識。到2023年七月,計共 1億偌的項目,量是 136.9 GB --- ## Lexeme Data - Taiwanese Taigi <section data-background-iframe="https://www.wikidata.org/wiki/Lexeme:L222612" data-preload data-background-interactive> <h2>Góa/Guá/我 - Wikidata</h2> </section> Note: Wikidata 的新資料形式辭條,會當家己寫字詞典 --- ## Villages in Taiwan * Start import to Wikidata in 2019, the OSM community spent 4 years to semi-import to OpenStreetMap * Simular Case: The Philippines [serv](https://www.openstreetmap/user/serv)'s [Barangay](https://en.wikipedia.org/wiki/Barangay) * Total number: 7,748 * Linked Household Register ID, OSM relation ID and Wikidata QID Note: 其實會來講臺灣村里的故事,是因為菲律賓社群的 Serv,伊會本名叫 Eugene,伊開始處理菲律賓的 Barangay,是菲律賓上小的行政單位。佇 OpenStreetMap 佮 Wikidata 建立村里,攏總開 4 年時間,計共 7,748 的村里處理好勢 (截至 2023 7/1),所有的村里連結戶役政系統代碼佮 Wikidata 編號。 --- ## Visualization of Villages on OSM [![](https://i.imgur.com/fs9Ds83.png)](https://overpass-turbo.eu/s/1kR3) Note: 七千偌接近八千个村里佇咧 OpenStreetMap 的視覺化 --- ## Villages on Wikidata ![](https://i.imgur.com/LphIh9G.jpg) Note: Wikidata Query 顯示的地理分佈 --- ## History of Villages Data on Wikidata * Started in 2019 * The Fudemental Geo Units * Using Open Government Data * Proposed [Housenhold Register Number - P5020](https://www.wikidata.org/wiki/Property:P5020) Note: 臺灣遮的村里資料是對2019年開始,村里是臺灣上基本的政治地理單位(忽略鄰一个通常無GSI範圍的單位)。是根據政府的開放資料來建立,嘛順紲共戶役政代碼提案屬性 P5020 --- ## Some Error * In 2019, Use the Old Dataset Released by the Government(~2018) * Missing the Merged and Newly Established Villages in Tainan City in 2018 * Solution: Use the New Dataset Note: 彼時用到舊的資料,已經無維護的主計處資料集(到2018年1爾爾)。所以台南2018年整併的里。解決的方式是用上新的戶政司的資料集 --- ## Error Again ![](https://i.imgur.com/1pcuj9H.png) Note: zh-min-nan 維基百科那邊有人建立全台北市的里,所以 Wikidata 有建立項目 --- ## Articles of zh-min-nan Wikipedia Established Wikidata Items ![](https://i.imgur.com/1OTBT0a.png) Note: Empty Items due to no one edit after the import --- ## Other Errors * Changhua Wikipedia Articles Workshop: Villages * Chaiyi Wikipedia Artiles Workshop: Villages --- ## Monitor New or Disbanded Villages Tools: [https://wikidata.planetoid.info/?q=已建立鄉鎮條目](https://wikidata.planetoid.info/?q=%已建立鄉鎮條目) ![](https://i.imgur.com/rAxjSeN.png) Note: 社群建立工具來監控對政府的資料集,有新的村里抑是刣去,就愛編輯 Wikidata 佮 OpenStreetMap --- ## Futures of Taiwan Rivers and Creeks * Missing Documents for Creeks * Hard to Edit [OpenStreetMap Relation](https://wiki.openstreetmap.org/wiki/Relation) * The Important of Survey * The Problem of [Ceb Wikipedia](https://en.wikipedia.org/wiki/Cebuano_Wikipedia) Note: 溪仔無資料,佇 OpenStreetMap 編輯溪流關係嘛真困難有難度。欲得著較正確的資料,有時愛實地踏查 --- ## Ceb Wiki - ljsbot [![](https://i.imgur.com/ZU69q1R.png)](https://www.vice.com/en/article/4agamm/the-worlds-second-largest-wikipedia-is-written-almost-entirely-by-one-bot) Note: Cebese Wikipedia 用機器人衝數量,無啥活人,通世界知 --- ### Ceb Wiki - non Correspondence Wikidata Item * A Ceb Wiki Article exists * GNS ID * No link to Wikidata --- ## A list of Rivers or Creeks in Taiwan [![](https://i.imgur.com/InLoWvp.png)](https://overpass-turbo.eu/s/1kR6) Note: 用 Overpass Turbo 會當得著臺灣所有的溪流的清單,有河川代碼佮 Wikidata 的編號 --- ## [Name Suggestion Index](https://wiki.openstreetmap.org/wiki/Name_Suggestion_Index) * Start in October 2013 as a Subsidiary project Under OpenStreetMap iD Editor * Independent Project in 2019: Announced in [State of the Map US 2019](https://2019.stateofthemap.us/program/sat/mapping-brands-with-the-name-suggestion-index.html) * A Pet Side Project with No-one Serious Care Note: Name Suggestion Index 是 2013 年 10 月開始的,附屬 OpenStreetMap 編輯器 iD 經營的,2019 年獨立出來。頭起先 NSI 是無人顧的 side project --- ## Odrinary Stores in Taiwan ![](https://i.imgur.com/oA8GOwf.png) Note: We have an active community to add chain stores information to NSI, powered by Wikidata --- ## Usage of Wikidata: Secondary Tags Using Wikidata on OpenStreetMap ![](https://i.imgur.com/Myfedyj.png) Note: OpenStreetMap 遮的 Wikidata 次級標籤列表 --- ## [subject:wikidata](https://wiki.openstreetmap.org/wiki/Key:subject:wikidata)=Q16574 > subject=蔣中正 (Chiang Kai-Shek) > OSM -> https://www.openstreetmap.org/node/2700264358 ![](https://i.imgur.com/wHu0cVj.jpg) Note: 蔣介石相關的記念物,道路、銅像(tâng-siōng)、各級學校的中正、介壽開頭的 --- ## Etymology - Chiang Kai-Shek Related Stuffs * [name:etymology](https://wiki.openstreetmap.org/wiki/Key:name:etymology)=蔣中正 * [name:etymology:wikidata](https://wiki.openstreetmap.org/wiki/Key:name:etymology:wikidata)=Q16574 * [name:etymology:wikipedia](https://wiki.openstreetmap.org/wiki/Key:name:etymology:wikipedia)=zh:蔣中正 Note: 名號來源,嘛會當透過 Wikidata 加添各地頭的中正路佮蔣介石的關係 --- ## Schools on OSM - [Overpass Query](https://overpass-turbo.eu/s/1mZA) ![](https://i.imgur.com/189UnMU.png) Note: OpenStreetMap 遮有記載的學校的視覺化結果 --- ## Elementary Schools on Wikidata ![](https://i.imgur.com/v52wwKk.png) Note: Wikidata 檢索的結果 --- ## [Wikiproject Taiwan/Schools](https://www.wikidata.org/wiki/Wikidata:WikiProject_Taiwan/Schools) ![](https://i.imgur.com/1YozljD.png) Note: 在地社群有建立協調的頁面,制定欲按怎編輯 --- ## The Challenge of School Data * Mass Imported in 2019, but a Gap in 2019-2022 * Compared School on Wikidata with the Government in 2023 * Add Newly Establish Schools, remove those withdrawed * Read News Articles if there area New or disbanded Schools * Data from other Wikipedia, like Japanese Wikipedia Note: 對 2019 年了後,無啥更新。Wikidata 遐有新設立的學校,是看著新聞去加的,抑是有人佇 OpenStreetMap 加添。除了更新資料以外,另外有日語維基遐重疊,毋過無偌濟。 --- ## What's Next? * Edit imported Agoda and Booking.com hotels' in Taiwan * metadata of Heritage sites * Village: Keep the Data up-to-date * School: Up to date(2023), has to deal with Branch Schhols and Indpentent Classes * River: Ceb Wikipedia Duplicated items * Bus and Railway Fan have aggregated Large amount of Photos, these Categories are mssing Wikidata Link --- ## Conclusion - Data Maintance * The Challenges of Maintainence: Villages and Schools Data * From Mass Import(One Time) to Additional Edits(time comsuming and need careful plan) * OpenStreetMap Wikidata-powered NSI * Use Cases, ex: Visualization --- ## Future Plan * Thematic Workshop * Linking to Different Third-Party Databases * Multilingual: Not only International Big Languages, But Also Taiwan National Languages: Taiwan Taigi, Taiwan Hakka, Taiwan Formosian Languages Note: 未來希望會當舉辦主題工作坊,毋但頭前講的村里、溪流、學校爾爾,各種資料庫的資料整理工課。另外濟語言的部份,不止仔台語的部份,猶閣有 Hak-ka-fa、臺灣原住民的語言 --- ## [To-siā!](https://en.wiktionary.org/wiki/%E5%A4%9A%E8%AC%9D#Chinese) [sṳ̀n-mùng-ǹ!](https://en.wiktionary.org/wiki/%E6%89%BF%E8%92%99%E4%BD%A0) Thank you! :sheep: 你可以在以下管道找到我 <div style="display:inline-flex;align-items:center;gap:2rem;"> <div style="flex:1;text-align:left" left> - [GitHub](https://github.com/Supaplextw/) - Supaplex: [Wikidata](https://wikidata.org/wiki/User:Supaplex),[OpenStreetMap](https://www.openstreetmap.org/user/Supaplex) - or [Email](mailto:dennis@wikimedia.tw) </div> <div style="flex:1;text-align:left" left> * [Wikidata Taiwan Facebook Group](https://www.facebook.com/groups/2212207218990971) * [OpenStreetMap台灣 Facebook Group](https://www.facebook.com/groups/OpenStreetMap.TW) * OSM Wiki [Taiwan](https://wiki.openstreetmap.org/wiki/Taiwan) </div> </div>
{"title":"From cross-linking multilingual articles to database entries, how does Wikidata link to the whole world's knowledge - Ta","description":"View the slide with \"Slide Mode\".","contributors":"[{\"id\":\"6d29f5f5-3da6-40f2-b920-e9a4cc2181dd\",\"add\":18128,\"del\":5308}]"}
    751 views
   Owned this note