OpenStreetMap kah Wikidata ê 整合 - 台灣上新 ê 例
slide: https://hackmd.io/@osm-tw/HkWWUOS_Xd
CC-BY-4.0 OpenStreetMap Taiwan Community
OpenStreetMap Taiwan
2025 8
Ta̍k-ke hó, Hello everyone, This is Dennis Raylin Chen from Taiwan, I want to talk about cleaning and managing dataset. My speech's title is "The Journal of Importing Open Data Address in Taiwan into OpenStreetMap". I will focus on importing address dataset
我 sī 啥人?
My online ID is Supaplex, one of the community member of OpenStreetMap Taiwan and Wikidata Taiwan, currently serving as a board of director of Wikimedia Taiwan
逐個月 kah Wikidata Taiwan 舉辦月聚
兩爿重疊 ê 成員
監視變動、討論上好 ê 編輯做法
I am one of the co-host of the monthly meetup in Taiwan, co-hosted with Wikidata Taiwan community. There are a hugh overlapped of community members between Wikidata and OpenStreetMap in Taiwan. The OpenStreetMap Taiwan community keep track of major development site, and sometimes discuss tagging scheme of mapping in Taiwan.
早年合作 ê 例:村里標示
村里匯入
主管機關 tùi 主計總處改 kàu 內政部戶政司
村里整併 kah 拆分
ex: 人口濟
ū 家私來監控政府端 ê 公告
We found out that some times villages will change. And made things worse is we didn't use the newest village dataset to import. It was dealt with Directorate-General of Budget, Accounting and Statistics, and hand over to Department of Household Registration to take care. And some times local government will merge or split villages.
Network Analysis of administrative units
This is an analytis by a Chinese mapper. There are some strange father-son relation in Taiwan. For example, empty township relation with no villages. And a single village with multi upper township relation.
溪流資料 ê 整理
Wikidata 屬性提案:河川代碼
國家語言標示
This is an analytis by a Chinese mapper. There are some strange father-son relation in Taiwan. For example, empty township relation with no villages. And a single village with multi upper township relation.
Ceb Wiki ljsbot: mass rivers import
The Cebuanoese Wikipedia is a mass robot imported Wikipedia. They use robot to massly create articles. And they also create many river items from GNS dataset.
River Dataset
Not every River has River Code
Wikidata(ceb) items import from GNS
The National map from NLSC
Community matching rivers and creeks with Wikidata and River Code
There are quite large spending on rivers in Taiwan by Taiwanese government. And they asign river code to each river. The number of list of river code is quite small compare to the actual number of river in Taiwan. We have to add more river on both Wikidata and OpenStreetMap, even though these rivers are not in the river code list.
學校資料特性
metadata 部分有 學校代碼 、地址、各語言名稱
Wikipedia, Wikidata, Wiki Commons
空間資料-OSM上有範圍
空間資料-Wikidata有收 經緯度
Wikidata 的國小
學校資料的自動化表格
學校資料的挑戰
對 2019 年匯入了後,無啥更新
看新聞加添新學校抑是處理廢除的學校
整併日語維基百科遐重複的資料
Name Suggestion Index 是 2013 年 10 月開始的,附屬 OpenStreetMap 編輯器 iD 經營的,2019 年獨立出來。頭起先 NSI 是無人顧的 side project
迒語言,用 Wikidata 來做對應
向望 OSM 遮標示會當標準化
台灣的常見商家
NSI ê 台灣 ê 銀行資料
台灣銀行 - NSI 數量 175 個
流程
屬性提案
爬蟲爬下資料集
清資料
匯入到 Wikidata
加Wikidata連結到OpenStreetMap
Here is my contact information, To-siā, sṳ̀n-mùng-ǹ! Thank you!
Resume presentation
OpenStreetMap kah Wikidata ê 整合 - 台灣上新 ê 例 slide: https://hackmd.io/@osm-tw/HkWWUOS_Xd CC-BY-4.0 OpenStreetMap Taiwan Community OpenStreetMap Taiwan 2025 8 Ta̍k-ke hó, Hello everyone, This is Dennis Raylin Chen from Taiwan, I want to talk about cleaning and managing dataset. My speech's title is "The Journal of Importing Open Data Address in Taiwan into OpenStreetMap". I will focus on importing address dataset
{"metaMigratedAt":"2023-06-14T11:37:17.557Z","metaMigratedFrom":"YAML","breaks":true,"description":"點此觀看原始內容","title":" \tOpenStreetMap kah Wikidata ê 整合 - 台灣上新 ê 例 ","contributors":"[{\"id\":\"6d29f5f5-3da6-40f2-b920-e9a4cc2181dd\",\"add\":16028,\"del\":5918}]"}