Buch, Englisch, 162 Seiten, Format (B × H): 160 mm x 241 mm, Gewicht: 453 g
Reihe: The Springer International Series in Engineering and Computer Science
Buch, Englisch, 162 Seiten, Format (B × H): 160 mm x 241 mm, Gewicht: 453 g
Reihe: The Springer International Series in Engineering and Computer Science
ISBN: 978-0-7923-7507-4
Verlag: Springer US
In , we study two closely related steps in any knowledge discovery system: the generation of discovered knowledge; and the interpretation and evaluation of discovered knowledge. In the generation step, we study data summarization, where a single dataset can be generalized in many different ways and to many different levels of granularity according to domain generalization graphs. In the interpretation and evaluation step, we study diversity measures as heuristics for ranking the interestingness of the summaries generated.
The objective of this work is to introduce and evaluate a technique for ranking the interestingness of discovered patterns in data. It consists of four primary goals:
- To introduce domain generalization graphs for describing and guiding the generation of summaries from databases.
- To introduce and evaluate serial and parallel algorithms that traverse the domain generalization space described by the domain generalization graphs.
- To introduce and evaluate diversity measures as heuristic measures of interestingness for ranking summaries generated from databases.
- To develop the preliminary foundation for a theory of interestingness within the context of ranking summaries generated from databases.
Zielgruppe
Research
Autoren/Hrsg.
Fachgebiete
- Interdisziplinäres Wissenschaften Wissenschaften: Forschung und Information Informationstheorie, Kodierungstheorie
- Mathematik | Informatik EDV | Informatik Daten / Datenbanken Informationstheorie, Kodierungstheorie
- Mathematik | Informatik EDV | Informatik Informatik Künstliche Intelligenz Wissensbasierte Systeme, Expertensysteme
- Mathematik | Informatik EDV | Informatik Daten / Datenbanken Zeichen- und Zahlendarstellungen
- Technische Wissenschaften Elektronik | Nachrichtentechnik Elektronik Robotik
- Mathematik | Informatik EDV | Informatik Informatik Logik, formale Sprachen, Automaten
Weitere Infos & Material
1. Introduction.- 2. Background and Related Work.- 3. A Data Mining Technique.- 4. Heuristic Measures of Interestingness.- 5. An Interestingness Framework.- 6. Experimental Analyses.- 7. Conclusion.- Appendices.- Comparison of Assigned Ranks.- Ranking Similarities.- Summary Complexity.