english english chinese chinese
Bookmark and Share
Home > Data Service > Data Annotation

Data Annotation

Kingline Data Centre is capable of and experienced in performing the labelling works of diversified ways including

           ♦          Text Labelling

           ♦          Web-page labelling

           ♦          Image labelling 

           ♦          Other labelling works in languages

--------------------------------------------------------------------------------------------------------------------------------------------------------

Text labelling

Based on its native experienced annotator teams and language linguist staff, Kingline Data Centre is professional in many kinds of script works: 

Phonetically Balanced script generation for TTS corpus
Phonetic balanced script design for ASR corpus
Phonetic lexicons production with SAMPA, XSAMPA and IPA
Pronunciation lexicons for text-to-speech applications
Variant pronunciation lexicons for Automatic Speech Recognition applications based on common forms and regional dialects
Other lexicons developed to match transcription data
Text tagging of word segmentation, name entity, grammar, part of speech, Semantics, Syntax, ontologies, Sentiment analysis and opinion mining and etc.
Multi-lingual dictionaries producing
Machine Translating corpora producing
Text producing of special field such as Command words, names, SMS, Email, Chatting, etc.

In many languages, some ready text corpora are available in specific categories such as common words, SMS, Names, Emails etc. please see product catalogue.

Web-page labelling

Kingline Data centre is experienced in providing webpage labelling services. By specially labelling the data, it can be used for the purpose of engine search training and can significantly improve the accuracy of machine algorithms. Based on its experienced labelling teams of various languages with diversified expertise in many fields, Kingline data Centre has labelled more than 300 million pages of hundreds of types for its clients with super high quality based on its sophisticated processes:

Shopping labeling Repeated pattern
Concept Hierarchy Leak recognition
Page relevance Match labeling
Web pages comparing Thread Labeling
Words relevance Review page labeling
Advertisement relevance Rating and labeling
Classification of query Extracting keywords of security
Phrase comparing ………………. 
User experience Study    

Image labelling

Kingline Data centre also provides image data and labelling services for pattern recognition research.  After image data is collected, some special features will be labelled based on special requirements to satisfy clients’ model training demands. At present time, millions of images have been processed for clients in a wide range of categories: 

Frame labelling
Face labelling
Hair labelling
Map labelling
Entity labelling
Handwriting symbol labelling
OCR result proofreading



Other labelling works in languages

Kingline Data Centre also provides many other kinds of annotating services of languages based on its client’s special demands.