<div style="font-size:120%;font-weight: bold;padding:28px;color: #000000">
Language Perspective of Wikimedia Movement in Taiwan: <br />The Challenge of Taiwanese Taigi and Taiwanese Mandarin
</div>
<!-- Put the link to this slide here so people can follow -->
<div style="font-size:14px;display:flex;background-color:rgb(157 195 145/0.8);padding:32px;justify-content:flex-end;">
<div style="flex-direction:column; text-align:left;color: #000000">
slide: https://hackmd.io/@wikidata-tw/Bk2gFjr2Jl
<!-- .slide: data-background="https://i.imgur.com/cOXo0Yj.jpg" data-background-opacity="0.75"---><br />
Wikimania 2025 <br />
Wikimedia Taiwan Tân Suī-lîm
</div>
</div>
Note:
Hello everybody, this is Dennis Chen, Tân Suī-lîm from Taiwan. Unfortunately, I can't make it here at the Wikimania in Nairobi, Kenya, on site due to I have an important roles in another Event, COSCUP, one of the biggest open source event in Taiwan.
The speech's title "Language Perspective of Wikimedia Movement in Taiwan: The Challenge of Taiwanese Taigi and Taiwanese Mandarin", will cover most of Taiwanese National Language, Taigi and Mandarin. Also I will mention other Southern Sinitic languages like Hakka, Cantonese, Bang-ue, Wu language
---
## Who am I?
<!-- .slide: data-background="https://i.imgur.com/cOXo0Yj.jpg" data-background-opacity="0.3"-->
<div style="font-size:50px;background-color:rgb(230 105 0/0.7);padding:32px;justify-content:flex-end;">
- :flag-tw: Supaplex
- [OpenStreetMap](https://www.openstreetmap.org/user/Supaplex) :heart: [Wikidata](https://www.wikidata.org/wiki/User:Supaplex) :heart:
- [Wikimedia Taiwan](https://meta.wikimedia.org/wiki/Wikimedia_Taiwan/zh) :cat:
</div>
Note:
You might have saw my account on Wikidata and OpenStreetMap. My online ID is Supaplex, mostly active on OpenStreetMap and Wikidata projects. I am also a member of Wikimeida Taiwan and the OpenStreetMap Data Working Group.
---
## Mandarin Wikipedia
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
<div style="background-color:rgb(0 0 0/1)">
* zh-tw, zh-cn, zh-hk
* other: zh-mo, zh-sg
* Conversion Table
</div>
Note:
Mandarin, sometimes has been called Chinese in a very simplification sitation, is the most well-known sinitic language around the world. It is wildly used in Taiwan, China, Singapore. In Southeast Asia countries, like the Philphine, Malaysia as a minority languages.
Hard Working conversion table to deal with simplified Chinese and traditional Chinese characterics conversion, and also a regional terms special conversion set.
---
## Conversion Table
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
<div style="background-color:rgb(255 255 255/1)">

</div>
Note:
Chinese Wikipedia uses a cusomized coversion system to deal with conversion of simplifed <->traditional hanji characterics, and also with regional terms conversion, for example, different published names of movies.
---
## Ideology Differences
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
<div style="background-color:rgb(255 255 255/1)">

</div>
Note:
But conversion tools might not be sutable in every situation. The Massive Recall Movement Wikidata item, some China people have adding descriptions that will cause controversities due to different ideologies. On Wikidata, it seems not possible to have a simple one sentence defination if you included every perspectives, for example, a negetive perspective.
---
## Chinese Idealogy and the Orphan Articles
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
<div style="background-color:rgb(255 255 255/1);display:inline-flex;align-items:center;gap:2rem;">
<div style="flex:1;text-align:left;font-size:100%;left" >

</div>
<div style="flex-direction:column; text-align:left">

</div>
</div>
Note:
Bunch of orphan Wikipedia disambiguation articles exist on Chinese Wikipeida that want to cover both China and Taiwan as the Chinese wing, which might not be agreed amount most Taiwanese folks.
---
## Mandarin Influences of the Southern Sintic Languages
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
<div style="background-color:rgb(0 0 0/0.7)">
* Taiwan: Speak Guoyu Movement、Bring Back My Mother Tounge
* Hong Kong: Biliteracy and Trilingualism
* China: Promotion of Standard Chinese
* Singapore: Speak Mandrin Campaign
</div>
Note:
There are Mandarin promotion champions. In Taiwan, we have the Speak Guoyu Movement, which suppress Taigi, Hakka and other minority sinitic languages, also Taiwan aboriginals languages. 30 years ago, there is a come back from Hakka people, which raise the flag of Bring Back My Mother Tounge movement.
Hong Kong British Colonization government has a much soft way: Biliteracy and Trillingualism, which not effect Cantonese as a education and government language, but using English and hanji for legel.
China and Singapore romotion of Standard Chinese, and Speak Mandarin Campaign , which is much inforced compared to Hong Kong, which suppressed the local and minority languages.
---
## Writing of Taigi on Wikipedia and Wikidata
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
* [Holopedia](https://nan.wikipedia.org/)
* [Wikidata](https://wikidata.org) nan/nan-hani/nan-hant/nan-latn-pehoeji/nan-latn-tailo label, description
* Wikidata [Lexeme](https://www.wikidata.org/wiki/Wikidata:Lexicographical_data) (Wikidata Lexeme)
* [Wiktionary](https://www.wiktionary.org/)
Note:
In the Wikimedia ecosystem, We have Nan Wikipedia, Wikidata, Wikidata Lexeme, and Wiktionary
---
## Introduction of Min Nan/Bân-lâm/Hokkien Language
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
<div>
<div style="background-color:rgb(255 255 255/1)">

</div>
<div style="background-color:rgb(0 0 0/1);font-size:20px">
<a href="https://commons.wikimedia.org/wiki/File:Banlamgu.svg">Luuva</a>, <a href="https://creativecommons.org/licenses/by-sa/3.0">CC BY-SA 3.0</a>, via Wikimedia Commons
</div>
</div>
Note:
the ISO 639-3 language code nan, not only included Taigi, but also China Banlam, Southeast Asia Hokkien, Hailamese, Tiongsan Ban, etc. These languages have different interoperability between each other, and in some extreme cases, it is very hard to communicate to each other, fox example, Hainanese. We will excluded those quite far branches like Hainanese and Teochew language.
---
## The Past of Taiwanese Taigi
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
* One Language Policy
* Mandarin in Education System
* Entertainment: Taigi Songs, Drama
* Market language
Note:
Taigi was harmed by single language policy in Taiwan, discourged in the school systems and the mass media. But Taigi speaking population is quite huge, it is still a vival vernacular language in the marketing and Taigi Entertainmentmarket, for example Taigi songs and dramas.
---
## Recent Taiwan Taigi revitalization
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
* Taiwan Taigi Rectification of Name and Normalization Movement
* Promotion of MOE Recommend Words
* Culture Preserve and Roles of Wiktionary, Holopedia, and Wikidata
Note:
But recent years, the revitalization for the languages in Taiwan, becoming National Languages protected by the law. Taiwanese Taigi is one of the National Languages, and got the resource for standarization: MOE recommend hanji. There are some contributors working on Holopedia, Wiktionary and Wikidata.
---
## Language Usage and Geological Perspect
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
<div style="background-color:rgb(0 0 0/0.7)">
<div>

</div>
<div style="background-color:rgb(0 0 0/0);font-size:20px">
[Source](https://www.facebook.com/staatsoper.wiener.7/posts/pfbid02n4Yguk6TKVgtgrZoiLRTRnUpca1N59FpxyZSDJxGb4ZZbSEiwKHJhDTAe87mjKMql)
</div>
</div>
Note:
A recent arguement in Taiwan that someone from China's Northeast area, don't speak Taigi to him. Whicn cause big controversities on Taiwan internet sphere.
---
## The Most Famous Taiwanese Taigi Speaker
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->

source: [CNA](https://www.cna.com.tw/news/ahel/202406160180.aspx)
Note:
Nvidia founder and CEO Jetson Huang is from and a Taiwanese Taigi speaker. When he comes to Taiwan, he will speak Taigi on stage.
---
## 2024 MediaWiki Update: nan-hant, nan-Latn-pehoeji, nan-Latn-tailo support
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->

Note:
In 2024, MediaWiki got updated, adding nan-hant support on Wikidata. It is possible using hanji in label field. And also the Romanization systen in wring in Pe̍h-ōe-jī and Tâi-lô.
---
## nan Community
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
* the original Nan Wikipedians
* [Hó-pêng-iú Wikimedians User Group](https://meta.m.wikimedia.org/wiki/Hopengiu_Wikimedians_User_Group)
* [Tâi-uân ê kong-biō-bûn-huà wiki tsuan-àn](https://meta.m.wikimedia.org/wiki/Hopengiu_Wikimedians_User_Group/Wiki_Soan-tho%C3%A2n_Oa%CC%8Dh-t%C4%81ng)
* Wikidata Taiwan community
Note:
Nan Wikipedia started at 2003, just one year after the establish of zh Wikipedia. And nan Wikipedia community choose to use POJ to write articles mainly, hanji as supplement.
Other communtiy included those NCCU aborginal center staffs, which got funded by Wikimedia Foundation for their contribution of nan Wiktictionary
And also the Wikidata Taiwan community
---
## POJ and Hanji: Permanent Duplicated Item
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
* Hakka, Bang-ue both have Hanji and Romanization articles
* Linkage of Same language but different writing system by Wikidata
Note:
There is a henomenon in those southern sinitic languages: huge difference between capital language and local languages, ex: Wu, Bangue(cdo), Hakka, Hokkien. Those languagse like to write in Romanization to distinguish the sound differences. And there are at least two layer of words for the same thing: 文音, 白話音.
For the different wrting system, it is possible to use Wikidata permanant duplicated to link each other
---
## Relation between Permanent Duplicated Items
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
|| Main Item | Permanent Duplicated Item |
|------| -------- | -------- |
|Wikidata| [Q865](https://www.wikidata.org/wiki/Q865) | [Q22337467](https://www.wikidata.org/wiki/Q22337467) |
| nan (zh-min-nan) | [Tiong-hôa-bîn-kok](https://zh-min-nan.wikipedia.org/wiki/Tiong-ho%C3%A2_B%C3%AEn-kok) | [中華民國(台灣)](https://zh-min-nan.wikipedia.org/wiki/Pang-chān:中華民國) |
| hak | [Chûng-fà Mìn-koet](https://hak.wikipedia.org/wiki/Ch%C3%BBng-f%C3%A0_M%C3%ACn-koet) | [中華民國(台灣)](https://hak.wikipedia.org/wiki/中華民國) |
| cdo | [Dṳ̆ng-huà Mìng-guók](https://cdo.wikipedia.org/wiki/D%E1%B9%B3%CC%86ng-hu%C3%A0_M%C3%ACng-gu%C3%B3k) | [中華民國(台灣)](https://cdo.wikipedia.org/wiki/中華民國) |
Note:
Here is Taiwan permanant duplicated item and its label for each language. The same hanji characters but totally different sound by Romanization writing system.
---
## Hard to launch International Collaborion of Hokkien Language
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
* Minority Languages outside Taiwan or China
* Heavily Influence by Mandarin
* Lack of Proper (Digital) Content
Note:
In most countries minority languages. And in Taiwan or China, there are not many writting content and both heavily influenced by Mandarin.
---
## Other Sinitic Languages
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
<div style="display:inline-flex;align-items:center;gap:2rem;">
<div style="flex:1;text-align:left;font-size:100%;left" >
**Written System**
* Chinese characters: [Cantonese](https://en.wikipedia.org/wiki/Yue_Chinese)(yue), [Wu](https://en.wikipedia.org/wiki/Wu_Chinese)(wuu)
* Romanization: [Hakka](https://en.wikipedia.org/wiki/Hakka_Chinese)(hak), [Bàng-uâ](https://en.wikipedia.org/wiki/Eastern_Min)(cdo), [Hokkien](https://en.wikipedia.org/wiki/Southern_Min)(nan)
</div>
<div style="flex:1;text-align:left;font-size:100%;left">
<div style="flex-direction:column; text-align:left">
**Main City**
* Big City: Wu(Shanghai), Nan(Taipei, Kauhsiung), Yue(Hong Kong)
* Music Industrial, Education
</div>
</div>
</div>
Note:
Some summary of the Southern Sinitic Languages practice: Writing system: Yue and wuu use hanji, Hakka, Bangue, Hokkien use romaization system.
And in the school system, both Hong Kong and Taiwan have yue or nan language courses.
---
## Sister's Project: nan name field on OSM
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
<div style="display:inline-flex;align-items:left;">
<div left>

</div>
<div style="font-size:85%;display:flex;background-color:rgb(127 195 140/0.5);padding:1px;justify-content:flex-end;">
<div style="flex-direction:column; text-align:left">
[OpenStreetMap](https://www.openstreetmap.org): Map version of Wikipedia, co-edited by contributers
OpenStreetMap name:nan: [Overpass Query](https://overpass-turbo.eu/s/19i0)
</div>
</div>
</div>
Note:
For the sister project of Wikmedia, there is a co-edit map OpenSteetMap, which is capatible to add multilanguage. Not only English, but also local language like Taiwanese. Here is a screenshot of Wikimedia Map, displaying local place written in POJ.
---
## [Wikimedia Map](https://maps.wikimedia.org/?lang=nan#8/22.482/114.420) by Wikimedia Foundation
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
<div style="display:inline-flex;align-items:left;">
<div left>

</div>
<div style="font-size:85%;display:flex;background-color:rgb(127 195 140/0.5);padding:1px;justify-content:flex-end;">
<div style="flex-direction:column; text-align:left">
Wikimedia Map [label nan POJ](https://maps.wikimedia.org/?lang=nan#7/24.722/117.806)
OpenStreetMap data
[Tâi-bûn hàn-jī](https://maps.wikimedia.org/?lang=nan-Hant#16/24.9725/121.5437), [Tâi-lô](https://maps.wikimedia.org/?lang=nan-Latn-tailo#16/24.9725/121.5437)
</div>
</div>
</div>
Note:
This is the Wikimedia Foundation powered map Wikimedia Map, which used name labels from OpenStreetMap. Wikimedia Map can swtich instanly to any langugae by typing the corresponded language code in the language parameter.
---
## Plan for nan Wikipedia
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
<div style="display:inline-flex;align-items:center;gap:2rem;">
<div style="flex:1;text-align:left;font-size:100%;left" >
* Let's Start Contribution: POJ or Hanji
* Taiwan MOE [Recommend Hanji](https://zh.wikipedia.org/zh-tw/臺灣台語推薦用字) or Koa-á-chheh Characters
</div>
<div style="flex:1;text-align:left;font-size:100%;left">
<div style="flex-direction:column; text-align:left">
* Hanji-POJ linked by Permanent Duplicated Items on Wikidata
* Administrative units
* Science
</div>
</div>
</div>
Note:
Lets write on nan Wikipedia, whatever it is POJ or hanji form.Using Taiwan MOE Recommend Hanji or Koa-á-chheh Hanji
Linking POJ written one to the hanji article by Wikidata
Might be administrative, science articles
---
## Please Join the Writing Campaign
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
* Ignored those claimed that violate linguistics' study; CoC
* The Possible split of nan ISO-639-3 code, might imclubated new Wikipedia Sites
* Edit nan/nan-hant/nan-latn-pehoeji/nan-latn-tailo labels or descriptions on Wikidata
* Change [zh-min-nan.wikipedia](https://zh-min-nan.wikipedia.org/) -> [nan.wikipedia](https://nan.wikipedia.org/) (ISO 639-3)
Note:
Ignore non-lingulist statement.
And we need code of conduct to regulate community discuss. If some extreme speach happened, then we can follow the CoC.
The ISO-693-3 proposal might have a chance for a new language code, which we will have the possible to incubate new Wikipedia site.
Edit Wikidata's label and descritpion for nan/nan-hani is much easy than a whole Wikipedia articles. We should not ignore Wiktonary and Wikidata Lexeme
---
## [To-siā!](https://en.wiktionary.org/wiki/%E5%A4%9A%E8%AC%9D#Chinese) [sṳ̀n-mùng-ǹ!](https://en.wiktionary.org/wiki/%E6%89%BF%E8%92%99%E4%BD%A0) Thank you!
<!-- .slide: data-background="https://upload.wikimedia.org/wikipedia/commons/thumb/1/1f/Wikidatacon_2023_Banner_-_04.png/1024px-Wikidatacon_2023_Banner_-_04.png" data-background-opacity="0.3"-->
:sheep: Q&A:
Communication Channels
- [GitHub](https://github.com/Supaplextw/)、[Wikidata](https://wikidata.org/wiki/User:Supaplex)、[OpenStreetMap](https://www.openstreetmap.org/user/Supaplex)
- [email](mailto:dennis@wikimedia.tw)
- Facebook Group: [Wikidata Taiwan](https://www.facebook.com/groups/2212207218990971/)、[OpenStreetMap Taiwan](https://www.facebook.com/groups/OpenStreetMap.TW/)
Note:
Here is my contact information, thank you.
{"title":"Language Perspective of Wikimedia Movement in Taiwan: The Challenge of Taiwanese Taigi and Taiwanese Mandarin","description":"View the slide with \"Slide Mode\".","contributors":"[{\"id\":\"6d29f5f5-3da6-40f2-b920-e9a4cc2181dd\",\"add\":53156,\"del\":32380,\"latestUpdatedAt\":1754298659104}]"}