OCSEAN

@ocsean

Public team

Community (0)
No community contribution yet

Joined on May 7, 2025

  • Date: June 2025 Authors: John Lennon L. Calorio, I Made Sena Darmasetiyawan, Putu Wahyu Widiatmika, I Komang Sumaryana Putra, Dendi Wijaya, Christopher Kinipi, Daniel Lawson Abstract: This technical report documents the activity of the Bristol Computational Linguistics and Data Science workshop, May-June 2025 in Bristol, UK. It comprises of the following activities: Linguistic Data Standardization Linguistic Initial Analysis Linkage
     Like 1 Bookmark
  •  Like  Bookmark
  • 8th May 2025 Kaiping, Steiger, Chousou-Polydouri (2022) Lexedata software Lexedata: A toolbox to edit CLDF lexical datasets, JOSS github readthedocs We should familiarize ourselves with JSON first before using PYTHON We are aiming towards CLDF application that has already worked with the data from Africa In the CLDF, we should try to identify the cognates first in the data We, as linguist, at least able to identify any mistakes with the raw data Try to get used to command line
     Like  Bookmark
  • Wahyu: Edited wordlist files by erasing 1) Brackets, 2) Question mark, 3) English note in certain forms, and 4) Odd forms. Languages: Arta Loloan malay Sabu Abui bunggeta Abui kilakawara Abui mobyetang Agusan manobo
     Like  Bookmark
  •  Like  Bookmark
  • Notes for the files Link to the Google Drive the 1st symbol that we can agree to remove totals=check(od,symbols=['!'],show=True) the the 2nd symbol that we can agree to replace (with e) totals=check(od,symbols=['ẽ'],show=True) the 3rd symbol that we can agree to replace (with u) totals=check(od,symbols=['ù'],show=True)
     Like  Bookmark
  • Reading List Reference material for linguistics Main OCSEAN Reading Group page Kaiping, Steiger, Chousou-Polydouri (2022) Lexedata software Lexedata: A toolbox to edit CLDF lexical datasets, JOSSgithub readthedocs Marian Klamer suggested we look at the work of Natalia Chousou-Polydouri Kaiping and Neureiter 2022 Clocks with bursts: Phylogenetic inference of schismogenesis in language evolution RSOS
     Like 1 Bookmark
  • Installing python Anaconda - a manager for python Reference material for learning coding JGI training page Introduction to Python Introduction to Python 2 Introduction to Data Analysis Matplotlib
     Like  Bookmark
  • OCSEAN linguistic collection, as it is now, also see the table attached: a bit of overview in numbersIncluded to master files were: 22 Indonesia (inlc 4 Abui varieties, 2 Balinese varieties), number of words ranging from 94 to 1226 34 Philippines (incl 2-3? Hiligaynon varieties, 2 Ivatan varieties), number of words ranging from 239 to 1225 4    Philippines (but with problems with lots of words) ​​​​NOT or NOT YET included to the Master file (problems listed in the table) 15 Philippines
     Like  Bookmark