Sidorov | Syntactic n-grams in Computational Linguistics | Buch | 978-3-030-14770-9 | sack.de

Buch, Englisch, 92 Seiten, Format (B × H): 155 mm x 235 mm, Gewicht: 172 g

Reihe: SpringerBriefs in Computer Science

Sidorov

Syntactic n-grams in Computational Linguistics


1. Auflage 2019
ISBN: 978-3-030-14770-9
Verlag: Springer International Publishing

Buch, Englisch, 92 Seiten, Format (B × H): 155 mm x 235 mm, Gewicht: 172 g

Reihe: SpringerBriefs in Computer Science

ISBN: 978-3-030-14770-9
Verlag: Springer International Publishing


This book is about a new approach in the field of computational linguistics related to the idea of constructing n-grams in non-linear manner, while the traditional approach consists in using the data from the surface structure of texts, i.e., the linear structure.

In this book, we propose and systematize the concept of syntactic n-grams, which allows using syntactic information within the automatic text processing methods related to classification or clustering. It is a very interesting example of application of linguistic information in the automatic (computational) methods. Roughly speaking, the suggestion is to follow syntactic trees and construct n-grams based on paths in these trees. There are several types of non-linear n-grams; future work should determine, which types of n-grams are more useful in which natural language processing (NLP) tasks.

This book is intended for specialists in the field of computational linguistics. However, we made an effort to explain ina clear manner how to use n-grams; we provide a large number of examples, and therefore we believe that the book is also useful for graduate students who already have some previous background in the field.

Sidorov Syntactic n-grams in Computational Linguistics jetzt bestellen!

Zielgruppe


Research


Autoren/Hrsg.


Weitere Infos & Material


Preface.- Introduction.- PART I.  VECTOR SPACE MODEL  IN THE ANALYSIS OF SIMILARITY  BETWEEN TEXTS.- Chapter 1.  Formalization in computational linguistics.- Chapter 2. Vector space model.- Chapter 3. Vector space model for texts and the tf-idf measure.- Chapter 4. Latent Semantic Analysis (LSA): reduction of dimensions.- Chapter 5. Design of experiments in computational linguistics.- Chapter 6.  Example of application of n-grams: authorship attribution using n-grams of syllables.- PART II.  NON-LINEAR CONSTRUCTION  OF N-GRAMS.- Chapter 7.  Syntactic n-grams: the concept.- Chapter 8. Types of syntactic n-grams according to their components.- Chapter 9.  Continuous and non-continuous syntactic n-grams.- Chapter 10. Metalanguage of syntactic n-grams representation.- Chapter 11.  Examples of construction of non-continuous syntactic n-grams.- Chapter 12. Automatic analysis of authorship using syntactic n-grams.- Chapter 13.  Filtered n-grams.- Chapter 14.  Generalized n-grams.


Grigori Sidorov is full Professor and researcher at the "Centro de Investigación en Computación" (Center for Computing Research, CIC),which is part of the "Instituto Politécnico Nacional" (National Polytechnic Institute), IPN in Mexico city, Mexico.



Ihre Fragen, Wünsche oder Anmerkungen
Vorname*
Nachname*
Ihre E-Mail-Adresse*
Kundennr.
Ihre Nachricht*
Lediglich mit * gekennzeichnete Felder sind Pflichtfelder.
Wenn Sie die im Kontaktformular eingegebenen Daten durch Klick auf den nachfolgenden Button übersenden, erklären Sie sich damit einverstanden, dass wir Ihr Angaben für die Beantwortung Ihrer Anfrage verwenden. Selbstverständlich werden Ihre Daten vertraulich behandelt und nicht an Dritte weitergegeben. Sie können der Verwendung Ihrer Daten jederzeit widersprechen. Das Datenhandling bei Sack Fachmedien erklären wir Ihnen in unserer Datenschutzerklärung.