rwebdb 23-September-2010

NCES recently released some of the 2009 IPEDS survey files. I added the 2009 institutional directory file to the warehouse, but in the process I also dealt with documentation issues, internal organization of data files and program files, ways to synchronize the warehouse and the IPEDS release files, initial steps toward automation of some warehouse procedures, and even some way-too-early considerations of a public domain open data license and the benefits of using github for parts of this project. Very fun.

rwebdb 20-September-2010

Two warehouse design principles, three verifications of data against published external sources, one new warehouse variable, two design traps, an issue related to the timing for release of public IPEDS data sets, and source synchronization as the next step.

rwebdb 14-September-2010

So far, so good. The initial build stage is complete. Six new variables were added to the warehouse. An initial verification of warehouse data was completed against sources published by the National Center for Education Statistics. Several new items to do as part of the second build phase in this project.

rwebdb 03-September-2010

The benefit from metadata and generalized utility programs can be immense. But first you need to hit a critical point where the number of utility tools is sufficient to do most of what needs to happen even in novel data situations. This week I had my first taste of that critical point for this project. It was a good feeling.