IndoLEM is a comprehensive Indonesian NLU benchmark, comprising three pillars NLP task: morpho-syntax, semantic, and discourse. Presented in COLING 2020.
-
Updated
Dec 14, 2020 - Python
IndoLEM is a comprehensive Indonesian NLU benchmark, comprising three pillars NLP task: morpho-syntax, semantic, and discourse. Presented in COLING 2020.
The first large-scale summarization corpus for the Indonesian language. AACL 2020.
indoBERT Base-Uncased fine-tuned on Translated Squad v2.0
DAC Unpad 2021 Final, predicting government sentiment analytics topic of PPKM COVID-19 Policy
A fine tuned IndoBERT model for University Sentiment On Social Media
Model analisis sentimen berbasis IndoBERT yang dapat memprediksi 6 jenis emosi dalam suatu kalimat, yaitu marah, sedih, senang, cinta, takut, dan jijik.
Just an example of how to use indobenchmark transformer (IndoBERT, IndoGPT, IndoBertTweet) in hugging face.
Web-based hoax detection using IndoBert Fine-tuned model
This repository contains the final project (skripsi) for sentiment classification on Indonesian Twitter data using the hashtag #KaburAjaDulu. It explores the performance comparison between a fine-tuned IndoBERT model and traditional machine learning models (such as SVM with IndoBERT embeddings). Built with 🤗 Hugging Face Transformers.
Model analisis sentimen berbasis IndoBERT yang dapat memprediksi sentimen dalam teks berbahasa Jawa Ngoko Lugu
NusaBERT: Teaching IndoBERT to be multilingual and multicultural!
This repository contains a comparative sentiment analysis project on Indonesian YouTube comments related to the Free Nutritious Meal Program (Makan Bergizi Gratis / MBG) using several Deep Learning Architectures.
AI-based detection of gambling promotion using IndoBERT and GPT (zero-shot & few-shot approaches)
IndoBERT is used for sentiment analysis of product reviews, helping businesses understand customer opinions. With fine-tuning, the model improves sentiment classification accuracy, enabling more effective marketing strategies such as ad personalisation, quick response, and service improvement based on customer feedback.
dataset came from scraping in play store livin by mandiri app
🥈🏆 SEPAKAT - Modul Integrasi is a winning project in Regsosek Hackathon 2022 organized by The Ministry of National Development Planning/Bappenas Indonesia. This module provides a single individual identification model by integrating Regsosek data as basic information which is then linked with related data using the idea of entity resolution.
This project focuses on the classification of Indonesian news headlines into clickbait and non-clickbait categories using Natural Language Processing (NLP) techniques. The study combines traditional Machine Learning approaches and state-of-the-art deep learning models to analyze linguistic patterns commonly found in clickbait headlines.
Gojek reviews sentiment analysis using IndoELECTRA and IndoBERT with full NLP pipeline (preprocessing, labeling, training, evaluation).
Add a description, image, and links to the indobert topic page so that developers can more easily learn about it.
To associate your repository with the indobert topic, visit your repo's landing page and select "manage topics."