The Cost of Analytical Query Processing in the Cloud, Part 1

This is the first in the series of blog posts on the topic. I will look at database options and examine each in terms of data scalability, query processing capability, and cost.
Any major analytical database solution in the cloud needs to support large databases and complex analytical queries. To refine the [...]

Reflections on Data Redundancy and the Shades of Denormalization

Database Denormalization Can Help Boost Query Performance
Database normalization and denormalization is one of the most discussed and debated topics ([1], [2], [3]).    In a nutshell, “normalization is eliminating redundant data (for example, storing the same data in more than one table) and ensuring data dependencies make sense (only storing related data in a table).”   [...]

New England Database Day 2009 at MIT

On Friday I attended the second New England Database Day at Massachusetts Institute of Technology, an all day conference-style event where participants mostly from the research community in the New England area came together to present ideas and discuss their research.
The conference format included two keynote talks, a single track of six technical session [...]

The Importance of MySQL 5.1 GA Release

MySQL 5.1 beta release cycle started in March of 2006 with the 5.1.7 Release. Last week, 15 beta and 9 RC releases later, MySQL 5.1 GA came out.
Many blogs have been written and published about new MySQL features included in this release. Some other blogs discussed its engineering quality. Indeed, just go to the MySQL [...]

OpenSQL Camp 2008: the Present and the Future of Open Source Databases

Last weekend I attended OpenSQL Camp, which was held November 14-16 in Charlottesville, Virginia. Among about 100 attendees from many countries were open source legends and community leaders, database product architects, developers, hackers, consultants, bloggers, and DBAs.
Open SQL Camp was a great success and a pleasure to attend. It was truly an [...]