A Glossary of Corpus Linguistics (Glossaries in Linguistics) by Paul Baker

By Paul Baker

This is often the 1st complete word list of the numerous professional phrases in corpus linguistics and gives an obtainable consultant for corpus linguists and non-corpus linguists alike.

Show description

Read Online or Download A Glossary of Corpus Linguistics (Glossaries in Linguistics) PDF

Similar language & grammar books

Translating and Interpreting Conflict.

The connection among translation and clash is very suitable in cutting-edge globalised and fragmented international, and this is often attracting elevated educational curiosity. This selection of essays was once encouraged by means of the 1st overseas convention to without delay handle the translator and interpreter s involvement in events of army and ideological clash, and its illustration in fiction.

Intonation Systems: A Survey of Twenty Languages

This is often the 1st accomplished examine of the intonation of other languages of the realm, written by means of a crew of best students within the box, so much of whom are local audio system of the language in query. Surveying twenty languages, the amount introduces a brand new procedure for the multilingual transcription of intonation styles.

Researching Audio Description: New Approaches

Audio description is without doubt one of the many companies on hand to assure accessibility to audiovisual media. It describes and narrates photographs and sounds and ensuing audio is then combined with the unique soundtrack. Audio description is a fancy procedure that touches creation, distribution and reception.

Extra info for A Glossary of Corpus Linguistics (Glossaries in Linguistics)

Example text

Speakers in the corpus may be categorised according to their dialect, and an encoding scheme which distinguishes between variant pronunciations is likely to be employed. ) Dialogue Annotation Tool (DAT) Developed at the Department of Computer Science, University of Rochester, USA, this tool is designed to apply the DAMSL markup scheme to corpus texts. DAMSL (discourse act markup in several layers) allows for the annotation of multiple layers of information relevant to the understanding and analysis of spontaneous conversation.

The following two dispersion plots are both from a small corpus of newsletters produced by a Catholic church. The two words (joy and abortion) have equal frequencies in the corpus, although the dispersion plots show that the term joy is more evenly dispersed throughout the speeches, whereas abortion occurs in fewer files and as a more focussed subject of discussion at various points: Fig. 1. Dispersion plot of joy Fig. 2. Dispersion plot of abortion distinctiveness coefficient Any statistical method of identifying lexical items that are most commonly associated with a particular language variety.

The resulting typed dialogue was recorded and it is these dialogues which make up the Coconut Corpus. The corpus is made up of twenty-four such dialogues, some of which include dialogue annotation. See Di Eugenio et al. (1998) for 48 A GLOSSARY OF CORPUS LINGUISTICS more details of the corpus and annotation scheme. edu/~coconut/coconut-corpus. ) copyright The right to publish and sell literary, musical or artistic work. Corpus compilers need to observe copyright law by ensuring that they seek permission from the relevant copyright holders to include particular texts.

Download PDF sample

Rated 4.37 of 5 – based on 41 votes