kobert

Name: kobert
Rating: 1.7 (20 reviews)

by monologg

Open source · 30k downloads · 20 likes

1.7

(20 reviews)EmbeddingAPI & Local

About

KoBERT is a language model specialized in processing Korean, built on the BERT architecture. It excels in understanding and generating Korean text, delivering optimized performance for the language. Its core capabilities include semantic analysis, text classification, question answering, and content generation. The model is particularly useful for applications requiring a nuanced grasp of Korean, such as chatbots, sentiment analysis tools, or machine translation systems. What sets it apart is its training on Korean-specific corpora, enabling it to outperform generic multilingual models on tasks involving the language.

Documentation

KoBERT

How to use

If you want to import KoBERT tokenizer with AutoTokenizer, you should give trust_remote_code=True.

Python

from transformers import AutoModel, AutoTokenizer

model = AutoModel.from_pretrained("monologg/kobert")
tokenizer = AutoTokenizer.from_pretrained("monologg/kobert", trust_remote_code=True)

Reference

https://github.com/SKTBrain/KoBERT

Capabilities & Tags

transformerspytorchjaxsafetensorsbertfeature-extractionko

Links & Resources