Skip to content
View SamuelCahyawijaya's full-sized avatar

Highlights

  • Pro

Organizations

@audioku @IndoNLP

Block or report SamuelCahyawijaya

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. IndoNLP/indonlu IndoNLP/indonlu Public

    The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)

    Jupyter Notebook 566 196

  2. IndoNLP/nusa-crowd IndoNLP/nusa-crowd Public

    A collaborative project to collect datasets in Indonesian languages.

    Jupyter Notebook 263 62

  3. IndoNLP/nusax IndoNLP/nusax Public

    High-quality parallel resource on sentiment analysis for 10 low-resource Indonesian languages, English, and Indonesian (Outstanding Paper at EACL 2023)

    Jupyter Notebook 95 10

  4. IndoNLP/indonlg IndoNLP/indonlg Public

    The first-ever vast natural language generation benchmark for Indonesian, Sundanese, and Javanese. We provide multiple downstream tasks, pre-trained IndoGPT and IndoBART models, and a starter code!…

    Python 71 12

  5. IndoNLP/nusa-writes IndoNLP/nusa-writes Public

    NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented and extremely low-resource Indonesian local languages.

    Jupyter Notebook 24 2

  6. SEACrowd/seacrowd-datahub SEACrowd/seacrowd-datahub Public

    A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.

    Python 70 57