Databases and Web Services[ Course mailing list | Lecture slides | Assignments | Exams | Projects | Literature | Extra links ]
Course topics: database services, database design (ER, UML), SQL, relational design theory, transaction management, web applications, XML.
This course introduces into (relational) database systems in theory and practice, with special emphasis on Web-based applications. In the accompanying project work, design and implementation of database and Web service components will be addressed, culminating in a sample database-enabled Web service based on the LAMP open source package. This course is recommended for all students specializing in computer science.
The course requires basic knowledge about algebraic expressions and laws, basic data structures like trees, object-oriented concepts, as well as - for the lab work - HTML and Linux.
Not a CS major? Then contact me upon semester start! Maybe you want to get specific support in the programming aspects.
Course mailing list
If you attend the course, make sure you are subscribed to that list to not miss important information!
[ Introduction and Project intro | Database design | The relational model | SQL | Relational Algebra | Query processing | Database application development | Web services | 3-tier architectures | Web Service protocols | Normal Forms & Physical database design | Security | Transactions | NoSQL | XML, XPath, XQuery | RDF, SPARQL | Array databases | OLAP | MapReduce | Wrap-up ]
...plus the talk given by Keith Hare, Convenor of the ISO SQL Working group.
Assignments will be published weekly (either here or via the course mailing list), they consist of both standalone tasks and workpackages of your projects. Assignments have to be handed in one week later by email as PDF or plain ASCII file to the teaching assistant. Late submissions will be disregarded; the only (and rare) exceptions to this are (i) registrar's exemption and (ii) case-by-case arrangement with me in advance with a Good Reason.
Several tasks require you to submit an individual answer, even though it may be part of the project. In such cases you may want to develop some solution within your team first, and then create your answer sheet by adapting the approach to your individual situation.
For the diagramming tasks, you may want to use dia.
Should you have questions on the outcome of an assignment, please contact the Teaching Assistants - they are happy to assist and explain!
For the rasdaman exercises you find instructions and exercises.
During the course you will have to implement your own Web service, based on Linux, Apache, Python, MySQL ("LAMP").To this end, you are expected to team up in groups of 2 - 3 early in the semester.
Use the ClamV environment (how to set up my own Web pages directory). Note that this excludes setting up your own environment, eg, on some Windows server! And, no, we tentatively do not use advanced tools like content management systems, because we want to learn about the internals of that technology.
I will suggest some topics, but it is highly appreciated if you come up with your own idea, maybe even for some service that will have its sustained life afterwards. It just needs to fulfil some criteria to make it manageable (not trivial, not too complex), so contact me early with your idea!
The main evaluation criteria for the website you will develop are as follows (in no particular order; authoritative list on course slide deck 00):
Further database material:
Database journals and conferences
Databases and the Web:
For the Unified Modeling Language (UML) there is many good tutorials available on the Web, for example
Sparx Systems' tutorial by
with a particular part on
Database Modeling in UML.
There is also a list of UML tutorials maintained by uml.org itself.
Some LAMP links:
The LAMP page by O'Reilly.
LAMP, as you know, stands for..., well, at first: Linux. Compare this and that site! Sometimes helpful: TLPD - The Linux Documentation Project.
Then, about the rest:
For PHP, there is some discussion of Apache configuration aspects, also touching security issues. This input sanitizing script might save your life one day.
Tuning of databases is something only marginally addressed in lecture, albeit a large (and fascinating) area, and of immense practical importance. Just two tools here: Practical query analysis produces HTML reports on slowest queries, most frequent queries, queries by type (select/insert/update/delete), and all that sort of thing for PostgreSQL and MySQL database logs. Using a template postgresql.conf and test data, pgAutotune will run through a process of testing different settings on your PostgreSQL server and determine which settings have the best performance. For additional material, see e.g. Software Engineering for Internat Applications.
For XML there is Altova's widely used XML spy; they also offer an XQuery tool. (note: the download script is broken, you need to manually ftp libxslt-1.1.13.tar.gz and libxml2-2.6.18.tar.gz from ftp://xmlsoft.org/libxslt). Books? Look, e.g., at XML.com!
Given the large variety of Web app implementation languages available it is no surprise that there is hot discussions on which platform is the best one. One line of arguments, which I share to a large extent (but not completely) is presented by two webcasts of a NASA/JPL guy (webcast1, webcast2).