251-0376-00L Data Warehousing
News
(August 16, 2009) Exam results are now available hereExams can be reviewed on Friday, August 21 and Tuesday, August 25
between 2 and 4 pm.
Contents
This course presents the principles of decision support systems and so-called online-analytic processing (OLAP). The course will cover the following topics:
1. Motivation and Architecture
2. Modelling for OLAP: Star Schemas and Snow Flake Schemas
3. SQL Extensions for Decision Support (Cube, Aggregation sets, Pivot,
etc.)
4. Demo of a Business Intelligence Tool (BTell)
5. Implementation Techniques (bitmap indexes, materialized views, star
joins, ...)
6. Parallel Databases (query execution strategies)
7. Data mining (classification trees, association rule mining, outlier
detection)
8. Data cleaning, identity reconciliation
9. Probabilistic databases, data lineage
10. Dataspace Management
11. Industry talks
Course Material
- Architecture, SQL Extensions
- Implementation Techniques (PDF, 1-page) (last update: March 20, List of new slides)
- Implementation Techniques (Parallel Databases) (PDF, 1-page)
- Data Integration
- Probabilistic Databases [PPT] [PDF]
- Data Mining
[PPT] [PDF]
Exercise Sheets
| Ex1: Star Schema & Summarizability. | Solution | fixed answer to 1.1 Q3 and alternative |
| Ex2: Advanced SQL. | Solution | |
| Ex3: How to implement Top K and Skyline. | Slides | |
| Ex4: Bitmap Indexes. | Solution | |
| Ex5: CSS- & CSB-Trees and Implementation Techniques applied. | Papers: CSS-Tree, CSB-Tree. Skeleton. Solution. | Fixed queries of Ex 5.2. |
| Ex6: Parallel Query Processing & Data Integration. | Solution |
|
| Ex7: Probabilistic Databases. | Solution |
|
| Ex8: Data Mining. | Slides |
|
| Rehearsal Slides. |
Schedule (2V1U)
- May 8th: Usual drill. Lecture: Data Mining, Exercise Session: Probabilistic Databases.
- May 15th: Industry Talk. At 9:15 in the usual lecture room (CAB G51) we have an exercise session (Topic: Data Mining). At 10:30 (location TBA) Franz Faerber from SAP will give a talk.
- May 22th: Rehearsal. At 9:15 in the usual lecture room (CAB G51) we will summarize the lecture and discuss past exams. Please send your questions to the TA as soon as possible. No exercise session.
- May 29th: Exam. The exam takes place in the usual lecture room (CAB G51). The duration will be 60 minutes. No exercise session.
| Friday | 9am-11am |
CAB G 51 |
Exercise Groups:
| Friday | 11am-12pm | CAB G 56 |
| ( Friday | 12pm-1pm | CAB G 56 *) |
* The second exercise group is cancelled!
Please contact the TA if you cannot attend the
session at 11am.
Exam
The exam will be on Friday, May 29 from 9am to 11am.
References
Articles of the research literature on databases and information systems (e.g., SIGMOD and VLDB). Exact references will be given in the lecture notes.
Lecturers
Dr. Carsten Binnig
Language
English
Requirements
ISG (Introduction to database systems, ER/UML modelling, normalization, SQL). Architecture & Implementation of Database Systems course is very helpful.



