21 relations: Algorithmic information theory, Apache OpenNLP, Apache Tika, Artificial grammar learning, Charset detection, Croatian language, Document classification, Function word, Indonesian language, Kolmogorov complexity, Language analysis for the determination of origin, List of family name affixes, Machine translation, Malay language, N-gram, Native-language identification, Natural language, Natural language processing, Serbian language, Statistics, Translation.
Algorithmic information theory
Algorithmic information theory is a subfield of information theory and computer science that concerns itself with the relationship between computation and information.
New!!: Language identification and Algorithmic information theory · See more »
Apache OpenNLP
The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.
New!!: Language identification and Apache OpenNLP · See more »
Apache Tika
Apache Tika is a content detection and analysis framework, written in Java, stewarded at the Apache Software Foundation.
New!!: Language identification and Apache Tika · See more »
Artificial grammar learning
Artificial grammar learning (AGL) is a paradigm of study within cognitive psychology and linguistics.
New!!: Language identification and Artificial grammar learning · See more »
Charset detection
Character encoding detection, charset detection, or code page detection is the process of heuristically guessing the character encoding of a series of bytes that represent text.
New!!: Language identification and Charset detection · See more »
Croatian language
Croatian (hrvatski) is the standardized variety of the Serbo-Croatian language used by Croats, principally in Croatia, Bosnia and Herzegovina, the Serbian province of Vojvodina and other neighboring countries.
New!!: Language identification and Croatian language · See more »
Document classification
Document classification or document categorization is a problem in library science, information science and computer science.
New!!: Language identification and Document classification · See more »
Function word
In linguistics, function words (also called functors) are words that have little lexical meaning or have ambiguous meaning and express grammatical relationships among other words within a sentence, or specify the attitude or mood of the speaker.
New!!: Language identification and Function word · See more »
Indonesian language
Indonesian (bahasa Indonesia) is the official language of Indonesia.
New!!: Language identification and Indonesian language · See more »
Kolmogorov complexity
In algorithmic information theory (a subfield of computer science and mathematics), the Kolmogorov complexity of an object, such as a piece of text, is the length of the shortest computer program (in a predetermined programming language) that produces the object as output.
New!!: Language identification and Kolmogorov complexity · See more »
Language analysis for the determination of origin
Language analysis for the determination of origin (LADO) is an instrument used in asylum cases to determine the national or ethnic origin of the asylum seeker, through an evaluation of their language profile.
New!!: Language identification and Language analysis for the determination of origin · See more »
List of family name affixes
Family name affixes are a clue for surname etymology and can sometimes determine the ethnic origin of a person.
New!!: Language identification and List of family name affixes · See more »
Machine translation
Machine translation, sometimes referred to by the abbreviation MT (not to be confused with computer-aided translation, machine-aided human translation (MAHT) or interactive translation) is a sub-field of computational linguistics that investigates the use of software to translate text or speech from one language to another.
New!!: Language identification and Machine translation · See more »
Malay language
Malay (Bahasa Melayu بهاس ملايو) is a major language of the Austronesian family spoken in Brunei, Indonesia, Malaysia and Singapore.
New!!: Language identification and Malay language · See more »
N-gram
In the fields of computational linguistics and probability, an n-gram is a contiguous sequence of n items from a given sample of text or speech.
New!!: Language identification and N-gram · See more »
Native-language identification
Native-language identification (NLI) is the task of determining an author's native language (L1) based only on their writings in a second language (L2).
New!!: Language identification and Native-language identification · See more »
Natural language
In neuropsychology, linguistics, and the philosophy of language, a natural language or ordinary language is any language that has evolved naturally in humans through use and repetition without conscious planning or premeditation.
New!!: Language identification and Natural language · See more »
Natural language processing
Natural language processing (NLP) is an area of computer science and artificial intelligence concerned with the interactions between computers and human (natural) languages, in particular how to program computers to process and analyze large amounts of natural language data.
New!!: Language identification and Natural language processing · See more »
Serbian language
Serbian (српски / srpski) is the standardized variety of the Serbo-Croatian language mainly used by Serbs.
New!!: Language identification and Serbian language · See more »
Statistics
Statistics is a branch of mathematics dealing with the collection, analysis, interpretation, presentation, and organization of data.
New!!: Language identification and Statistics · See more »
Translation
Translation is the communication of the meaning of a source-language text by means of an equivalent target-language text.
New!!: Language identification and Translation · See more »
Redirects here:
Automatic language identification, Language detection, Language guessing, Language identifying.
References
[1] https://en.wikipedia.org/wiki/Language_identification