Mathematical linguistics
Example Applications of Mathematical Linguistics
Mathematical linguistics is the application of mathematics to model phenomena and solve problems in general linguistics and theoretical linguistics. Mathematical linguistics has a significant amount of overlap with computational linguistics. Discrete MathematicsDiscrete mathematics is used in language modeling, including formal grammars, language representation, and historical linguistic trends. Set TheorySemantic classes, word classes, natural classes, and the allophonic variations of each phoneme in a language are all examples of applied set theory. Set theory and concatenation theory are used extensively in phonetics and phonology. CombinatoricsIn phonotactics, combinatorics is useful for determining which sequences of phonemes are permissible in a given language, and for calculating the total number of possible syllables or words, based on a given set of phonological constraints. Combinatorics on words can reveal patterns within words, morphemes, and sentences. Finite-State TransducersContext-sensitive rewriting rules of the form a → b / c _ d, used in linguistics to model phonological rules and sound change, are computationally equivalent to finite-state transducers, provided that application is nonrecursive, i.e. the rule is not allowed to rewrite the same substring twice.[1] Weighted FSTs found applications in natural language processing, including machine translation, and in machine learning.[2][3] An implementation for part-of-speech tagging can be found as one component of the OpenGrm[4] library. AlgorithmsOptimality theory (OT) and maximum entropy (Maxent) phonotactics use algorithmic approaches when evaluating candidate forms (phoneme strings) for determining the phonotactic constraints of a language.[5] Graph TheoryTrees have several applications in linguistics, including: Other graphs that are used in linguistics include:
Formal linguisticsFormal linguistics is the branch of linguistics which uses formal languages, formal grammars and first-order logical expressions for the analysis of natural languages. Since the 1980s, the term is often used to refer to Chomskyan linguistics.[6] LogicLogic is used to model syntax, formal semantics, and pragmatics. Modal logic can model syntax that employs different grammatical moods.[7] Most linguistic universals (e.g. Greenberg's linguistic universals) employ propositional logic. Lexical relations between words can be determined based on whether a pair of words satisfies conditional propositions.[8]
SemioticsMethods of formal linguistics were introduced by semioticians such as Charles Sanders Peirce and Louis Hjelmslev. Building on the work of David Hilbert and Rudolf Carnap, Hjelmslev proposed the use of formal grammars to analyse, generate and explain language in his 1943 book Prolegomena to a Theory of Language.[9][10] In this view, language is regarded as arising from a mathematical relationship between meaning and form. The formal description of language was further developed by linguists including J. R. Firth and Simon Dik, giving rise to modern grammatical frameworks such as systemic functional linguistics and functional discourse grammar. Computational methods have been developed by the framework functional generative description among others. Dependency grammar, created by French structuralist Lucien Tesnière,[11] has been used widely in natural language processing. Differential Equations & Multivariate CalculusThe Fast Fourier Transform, Kalman filters, and autoencoding are all used in signal processing (advanced phonetics, speech recognition). StatisticsIn linguistics, statistical methods are necessary to describe and validate research results, as well as to understand observations and trends within an area of study. Corpus statisticsStudent's t-test can be used to determine whether the occurrence of a collocation in a corpus is statistically significant.[12] For a bigram , let be the unconditional probability of occurrence of in a corpus with size , and let be the unconditional probability of occurrence of in the corpus. The t-score for the bigram is calculated as: where is the sample mean of the occurrence of , is the number of occurrences of , is the probability of under the null-hypothesis that and appear independently in the text, and is the sample variance. With a large , the t-test is equivalent to a Z-test. LexicostatisticsLexicostatistics can model the lexical similarities between languages that share a language family, sprachbund, language contact, or other historical connections. Quantitative linguisticsQuantitative linguistics (QL) deals with language learning, language change, and application as well as structure of natural languages. QL investigates languages using statistical methods; its most demanding objective is the formulation of language laws and, ultimately, of a general theory of language in the sense of a set of interrelated languages laws.[13] Synergetic linguistics was from its very beginning specifically designed for this purpose.[14] QL is empirically based on the results of language statistics, a field which can be interpreted as statistics of languages or as statistics of any linguistic object. This field is not necessarily connected to substantial theoretical ambitions. Corpus linguistics and computational linguistics are other fields which contribute important empirical evidence. Quantitative comparative linguisticsQuantitative comparative linguistics is a subfield of quantitative linguistics which applies quantitative analysis to comparative linguistics. It makes use of lexicostatistics and glottochronology, and the borrowing of phylogenetics from biology. See AlsoReferences
Bibliography
|
Portal di Ensiklopedia Dunia