Concept information
Preferred term
language modelling
Definition
- the construction of statistical or Machine Learning language models
Broader concept
Narrower concepts
Entry terms
- language modeling
- LM
Note
- A Statistical Language Model predicts a word given a sequence of already known words (i.e. the history). Ist can also be applied to other sequences of symbols (e.g. DNA). Very often the history contains just the previous two words. This is called a trigram. The parameters of statistical language models are estimated from a set of training examples. Data sparsity and smoothing of the estimates is one of the core problems. The best smoothing technique known so far is Kneser-Ney-Smoothing. Maximum-Entropy techniques are also under investigation and may be the method of choice for long-range language models (beyond trigram). Language models are used in text-compression, speech recognition, information retrieval and information extraction.
Source
- https://www.di.uoa.gr/sites/default/files/documents/grad/M908-YPOLOGISTIKH_GLVSSOLOGIA_NLP_DIAXEIRISH_GLWSSIKWN_PORWN.pdf
In other languages
-
Greek
URI
http://w3id.org/clarin_el_dictionary/languageModelling
{{label}}
{{#each values }} {{! loop through ConceptPropertyValue objects }}
{{#if prefLabel }}
{{/if}}
{{/each}}
{{#if notation }}{{ notation }} {{/if}}{{ prefLabel }}
{{#ifDifferentLabelLang lang }} ({{ lang }}){{/ifDifferentLabelLang}}
{{#if vocabName }}
{{ vocabName }}
{{/if}}