In this paper I will give you a brief overview of Sketch Engine’s core functions, tell you details about its development, the people who originally had the idea to create it and about the free corpus-based web tool “SkeLL”.
Sketch Engine is a software for text analysis, database management and corpus management for over 90 languages developed by Lexical Computing Limited and released in 2003. It is a great tool for linguists, lexicographers, translators, students, and teachers to understand how language works. By using Sketch Engine you are able to instantly identify the characteristics of the language, what is rare or unusual and what is increasingly used. This is possible due to its algorithms which can analyse authentical texts with billions of words. The software’s functions enable users to precisely search and filter queries in language corpora and said functions are based on mathematical and statistical computations.
Table of Contents
1. Introduction
2. Development
2.1. Lexical Computing Limited
2.2. Developers: Adam Kilgarriff & Pavel Rychlý
3. Features
3.1. Word Sketch
3.2. Word Sketch Difference
3.3. Thesaurus
3.4. Corpus Building
3.5. Key Word and Term Extraction
3.6. Concordance
3.7. Translation Search
3.8. Frequence Word Lists
4. SkELL
5. Bibliography
Objectives & Topics
The primary objective of this work is to provide a comprehensive overview of the Sketch Engine software, detailing its core functionalities for corpus linguistics and text analysis. The paper examines the development history of the tool, introduces its key architects, and explores its practical applications in modern lexicography and language research.
- The historical development and origin of Sketch Engine by Lexical Computing Limited.
- Technical features including Word Sketches, concordancing, and thesaurus generation.
- Methods for automatic corpus building and the extraction of keywords and terms.
- The role of the SkELL web tool in facilitating language learning through high-quality corpora.
Excerpt from the Book
3.1 Word Sketch
They gather information from millions of examples of use and create a one-page summary of categorised collocations with links to examples. You will know how a word is used by simply looking at this one page.
The word sketch shows you other related words and the collocates of the word. The categorically sorted results are called grammatical relations. They include words which serve as an object of the verb, words which serve as a subject of the verb, words used to modify the word and many more. There are rules written in the sketch grammar that define which words will be included in the analysis.
Summary of Chapters
1. Introduction: This chapter provides an overview of Sketch Engine, highlighting its function as a corpus management and text analysis software used by linguists and dictionary publishers.
2. Development: This section covers the founding of Lexical Computing Limited and profiles the primary developers, Adam Kilgarriff and Pavel Rychlý, detailing their contributions to computational linguistics.
3. Features: This chapter details the technical capabilities of the software, specifically focusing on tools for collocations, synonym generation, corpus building, and frequency analysis.
4. SkELL: This chapter introduces SkELL (Sketch Engine for Language Learning), explaining its purpose as a simplified, free tool for teachers and students to access high-quality example sentences.
5. Bibliography: This section provides a list of academic and technical references used to compile the information regarding the software and its development.
Keywords
Sketch Engine, Corpus Linguistics, Lexicography, Lexical Computing, Word Sketch, Concordance, Text Corpora, Computational Linguistics, Adam Kilgarriff, Language Learning, SkELL, Term Extraction, Natural Language Processing, Statistical Analysis, Data Mining.
Frequently Asked Questions
What is the core purpose of Sketch Engine?
Sketch Engine is a software suite designed for text analysis, database management, and corpus management, serving as a vital tool for linguists, lexicographers, and translators to analyze how languages function.
What are the primary topics covered in this paper?
The paper covers the origins of the software, the contributions of its founders, its central technical features like Word Sketches and concordance searches, and the specialized web tool SkELL.
What is the main research focus of this document?
The document aims to inform the reader about Sketch Engine's core functions and its historical development, providing a structured insight into how it processes billions of words for linguistic research.
Which scientific methods are primarily utilized by the software?
The software relies on mathematical and statistical computations, as well as linguistic algorithms, to process authentic texts and generate grammatical and collocational summaries.
What constitutes the main body of the work?
The main body focuses on the practical application of the software's features, explaining how tools like the Thesaurus, Corpus Building, and Keyword Extraction work in a practical environment.
Which keywords best characterize the work?
Keywords include Sketch Engine, Corpus Linguistics, Lexicography, Word Sketch, Concordance, and Computational Linguistics.
Who was the primary developer of the Word Sketch concept?
The concept of word sketches, which serves as a fundamental feature of the software, was invented by the lexicographer and research scientist Adam Kilgarriff.
What is the difference between Sketch Engine and SkELL?
While Sketch Engine is a comprehensive tool for professionals and researchers, SkELL is a specific, free web-based subset of the technology tailored for language learners and teachers to find high-quality sentence examples.
- Quote paper
- Jennifer Berndt (Author), 2019, Sketch Engine by Adam Kilgarriff. An analysis of its core functions and development, Munich, GRIN Verlag, https://www.grin.com/document/509432