71 relations: Academic writing, American National Corpus, Annotation, Bank of English, Bilingual dictionary, British English, British Library, British Library Sound Archive, Brown Corpus, Chambers Harrap, Collocation, Computational linguistics, Conversation, Corpus linguistics, Corpus manager, Corpus of Contemporary American English, Data, Demography, English for specific purposes, English language, English relative clauses, Fiction, Genre, Gerund, Hidden Markov model, Inflection, Intellectual property, International Corpus of English, Lancaster University, Language, Lemma (psycholinguistics), Lexicon, Longman, Markup language, Metadata, Natural language processing, Newspaper, Novel, Orthography, Oxford English Corpus, Oxford University Phonetics Lab, Oxford University Press, Paralanguage, Part of speech, Poetry, Pragmatics, Pronoun, Reference, Semantics, SemEval, ..., Short story, Sketch Engine, Socioeconomics, Speech, Spoken English Corpus, Spoken language, Stereotype, Synchrony and diachrony, Text corpus, Text Encoding Initiative, Translation, University of Oxford, Variety (linguistics), Verb, Vernacular, Word-sense disambiguation, World Englishes, Writing, Written language, Xaira, XML. Expand index (21 more) » « Shrink index
Academic writing is conducted in several sets of forms and genres, normally in an impersonal and dispassionate tone, targeted for a critical and informed audience, based on closely investigated knowledge, and intended to reinforce or challenge concepts or arguments.
The American National Corpus (ANC) is a text corpus of American English containing 22 million words of written and spoken data produced since 1990.
An annotation is a metadatum (e.g. a post, explanation, markup) attached to location or other data.
The Bank of English is a representative subset of the 4.5 billion words COBUILD corpus, a collection of English texts.
A bilingual dictionary or translation dictionary is a specialized dictionary used to translate words or phrases from one language to another.
British English is the standard dialect of English language as spoken and written in the United Kingdom.
The British Library is the national library of the United Kingdom and the largest national library in the world by number of items catalogued.
The British Library Sound Archive (formerly the British Institute of Recorded Sound; also known as the National Sound Archive (NSA)) in London, England is among the largest collections of recorded sound in the world, including music, spoken word and ambient recordings.
The Brown University Standard Corpus of Present-Day American English (or just Brown Corpus) was compiled in the 1960s by Henry Kučera and W. Nelson Francis at Brown University, Providence, Rhode Island as a general corpus (text collection) in the field of corpus linguistics.
Chambers Harrap Publishers (Chambers Harrap Publishers Ltd) is a reference publisher formerly based in Edinburgh, Scotland, which held the property rights of the venerable W.R. Chambers Publishers and its competitor George G. Harrap and Company (founded: 1901).
In corpus linguistics, a collocation is a sequence of words or terms that co-occur more often than would be expected by chance.
Computational linguistics is an interdisciplinary field concerned with the statistical or rule-based modeling of natural language from a computational perspective, as well as the study of appropriate computational approaches to linguistic questions.
Conversation is interactive communication between two or more people.
Corpus linguistics is the study of language as expressed in corpora (bodies) of "real world" text.
A corpus manager (corpus browser or corpus query system) is a tool for multilingual corpus analysis, which allows effective searching in corpora.
The Corpus of Contemporary American English (COCA) is 450-million-word corpus of American English.
Data is a set of values of qualitative or quantitative variables.
Demography (from prefix demo- from Ancient Greek δῆμος dēmos meaning "the people", and -graphy from γράφω graphō, implies "writing, description or measurement") is the statistical study of populations, especially human beings.
English for specific purposes (ESP) is a subset of English as a second or foreign language.
English is a West Germanic language that was first spoken in early medieval England and is now a global lingua franca.
Relative clauses in the English language are formed principally by means of relative pronouns.
Fiction is any story or setting that is derived from imagination—in other words, not based strictly on history or fact.
Genre is any form or type of communication in any mode (written, spoken, digital, artistic, etc.) with socially-agreed upon conventions developed over time.
A gerund (abbreviated) is any of various nonfinite verb forms in various languages, most often, but not exclusively, one that functions as a noun.
Hidden Markov Model (HMM) is a statistical Markov model in which the system being modeled is assumed to be a Markov process with unobserved (i.e. hidden) states.
In grammar, inflection or inflexion – sometimes called accidence – is the modification of a word to express different grammatical categories such as tense, case, voice, aspect, person, number, gender, and mood.
Intellectual property (IP) is a category of property that includes intangible creations of the human intellect, and primarily encompasses copyrights, patents, and trademarks.
The International Corpus of English (ICE) is a set of corpora representing varieties of English from around the world.
Lancaster University, also officially known as the University of Lancaster, is a public research university in the City of Lancaster, Lancashire, England.
Language is a system that consists of the development, acquisition, maintenance and use of complex systems of communication, particularly the human ability to do so; and a language is any specific example of such a system.
In psycholinguistics, a lemma (plural lemmas or lemmata) is an abstract conceptual form of a word that has been mentally selected for utterance in the early stages of speech production.
A lexicon, word-hoard, wordbook, or word-stock is the vocabulary of a person, language, or branch of knowledge (such as nautical or medical).
Longman, commonly known as Pearson Longman, is a publishing company founded in London, England, in 1724 and is owned by Pearson PLC.
In computer text processing, a markup language is a system for annotating a document in a way that is syntactically distinguishable from the text.
Metadata is "data that provides information about other data".
Natural language processing (NLP) is an area of computer science and artificial intelligence concerned with the interactions between computers and human (natural) languages, in particular how to program computers to process and analyze large amounts of natural language data.
A newspaper is a periodical publication containing written information about current events.
A novel is a relatively long work of narrative fiction, normally in prose, which is typically published as a book.
An orthography is a set of conventions for writing a language.
The Oxford English Corpus is a text corpus of 21st century English, used by the makers of the Oxford English Dictionary and by Oxford University Press's language research programme.
The Phonetics Laboratory is the phonetics laboratory at the University of Oxford, England.
Oxford University Press (OUP) is the largest university press in the world, and the second oldest after Cambridge University Press.
Paralanguage is a component of meta-communication that may modify meaning, give nuanced meaning, or convey emotion, such as prosody, pitch, volume, intonation, etc.
In traditional grammar, a part of speech (abbreviated form: PoS or POS) is a category of words (or, more generally, of lexical items) which have similar grammatical properties.
Poetry (the term derives from a variant of the Greek term, poiesis, "making") is a form of literature that uses aesthetic and rhythmic qualities of language—such as phonaesthetics, sound symbolism, and metre—to evoke meanings in addition to, or in place of, the prosaic ostensible meaning.
Pragmatics is a subfield of linguistics and semiotics that studies the ways in which context contributes to meaning.
In linguistics and grammar, a pronoun (abbreviated) is a word that substitutes for a noun or noun phrase.
Reference is a relation between objects in which one object designates, or acts as a means by which to connect to or link to, another object.
Semantics (from σημαντικός sēmantikós, "significant") is the linguistic and philosophical study of meaning, in language, programming languages, formal logics, and semiotics.
SemEval (Semantic Evaluation) is an ongoing series of evaluations of computational semantic analysis systems; it evolved from the Senseval word sense evaluation series.
A short story is a piece of prose fiction that typically can be read in one sitting and focuses on a self-contained incident or series of linked incidents, with the intent of evoking a "single effect" or mood, however there are many exceptions to this.
Sketch Engine is a corpus manager and text analysis software developed by Lexical Computing Limited since 2003.
Socioeconomics (also known as social economics) is the social science that studies how economic activity affects and is shaped by social processes.
Speech is the vocalized form of communication used by humans and some animals, which is based upon the syntactic combination of items drawn from the lexicon.
The Spoken English Corpus (SEC) is a speech corpus used in corpus linguistics consisting of a collection of recordings of spoken British English compiled during the period 1984-7 through a collaboration, funded by IBM, between the Unit for Computer Research on the English Language (UCREL) at the University of Lancaster and the IBM Scientific Centre in Winchester.
A spoken language is a language produced by articulate sounds, as opposed to a written language.
In social psychology, a stereotype is an over-generalized belief about a particular category of people.
Synchrony and diachrony are two different and complementary viewpoints in linguistic analysis.
In linguistics, a corpus (plural corpora) or text corpus is a large and structured set of texts (nowadays usually electronically stored and processed).
The Text Encoding Initiative (TEI) is a text-centric community of practice in the academic field of digital humanities, operating continuously since the 1980s.
Translation is the communication of the meaning of a source-language text by means of an equivalent target-language text.
The University of Oxford (formally The Chancellor Masters and Scholars of the University of Oxford) is a collegiate research university located in Oxford, England.
In sociolinguistics a variety, also called a lect, is a specific form of a language or language cluster.
A verb, from the Latin verbum meaning word, is a word (part of speech) that in syntax conveys an action (bring, read, walk, run, learn), an occurrence (happen, become), or a state of being (be, exist, stand).
A vernacular, or vernacular language, is the language or variety of a language used in everyday life by the common people of a specific population.
In computational linguistics, word-sense disambiguation (WSD) is an open problem of natural language processing and ontology.
World Englishes is a term for emerging localized or indigenized varieties of English, especially varieties that have developed in territories influenced by the United Kingdom or the United States.
Writing is a medium of human communication that represents language and emotion with signs and symbols.
A written language is the representation of a spoken or gestural language by means of a writing system.
Xaira is an XML Aware Indexing and Retrieval Architecture developed at Oxford University.
In computing, Extensible Markup Language (XML) is a markup language that defines a set of rules for encoding documents in a format that is both human-readable and machine-readable.