Research

Solutions
WhatsApp Marketing +Elevate customer interactions with WhatsApp Marketing +
Customer ServiceAutomated support on any channels to improve customer satisfaction
Marketing EngagementKnow your customers and engage them through website or chat apps
RecruitmentHiring talent by harnessing AI to simplify recruitment process
Industries
TelcoConversational AI for Telco
HealthtechConversational AI for healthcare
RetailConversational AI for retail
Financial Service IndustryConversational AI for FSI and Bank
Consumer GoodsConversational AI for FMCG
Products
Kata Platform
All-in-one chatbot platform
Kata Voice
Voice-to-text and text-to-voice API
Business Dashboard
All-in-one dashboard to manage WhatsApp Business API
Pricing
Resources
BlogNews, tips, and best practices on AI
Kata EngineeringKata.ai Engineering Team
White PaperDownload our White paper
Case StudySee success story with Kata.ai
ResearchOur recent academic publication
CommunityJoin our growing community
Company
About usOur history and leadership
PartnershipBecome Kata.ai solution partner
CareerCareer opportunities at Kata.ai
Press & MediaNews and media coverage
AwardsAwards and recognition
Exploring Conversational AI solution?
Contact Us

Publication year: 2018

Toward a Standardized and More Accurate Indonesian Part-of-Speech Tagging

Written by:
Kemal Kurniawan, Alham Fikri Aji

Abstract

Previous work in Indonesian part-of-speech (POS) tagging are hard to compare as they are not evaluated on a common dataset. Furthermore, in spite of the success of neural network models for English POS tagging, they are rarely explored for Indonesian. In this paper, we explored various techniques for Indonesian POS tagging, including rule-based, CRF, and neural network-based models. We evaluated our models on the IDN Tagged Corpus. A new state-of-the-art of 97.47 F1 score is achieved with a recurrent neural network. To provide a standard for future work, we release the dataset split that we used publicly.

Download Full Paper

Other case Paper

Publication year: 2021

BERT Goes Brrr: A Venture Towards the Lesser Error in Classifying Medical Self-Reporters on Twitter

Read paper

Publication year: 2021

IndoCollex: A Testbed for Morphological Transformation of Indonesian Word Colloquialism

Read paper

Publication year: 2020

Benchmarking Multidomain English-Indonesian Machine Translation

Read paper

Ready to build your conversational AI?

Get started