I think this is a good idea for a project. Jeremy will be presenting many learnings from KernelTrap and the CivicSpace ASP at the Drupal Scalability workshop on Saturday. The Drupal community has learned a lot, in particular Gerhard has captured and tested significant performance and scalability improvements in building the
Drupal.org infrastructure.
Having an intern document all these learnings would be great asset for the community and help in the adoption of Drupal by businesses that ultimately ask the question, "Will Drupal scale?".
CivicSpace has learned a lot about high availability and scalability in building our ASP and we'd be interested in sharing what we've learned, which we have documented fairly well. I'd be happy to make much of this available if we believed that work would receive good stewardship.
Topics to consider:
Scaling
1) Scaling web servers horizontally. Shared file systems for horizontally scaling web servers.
2) Scaling databases.
3) Performance tuning queries. Configuring databases for load characteristics. Managing IO bottlenecks.
4) Logging - managing logs for scalability and IO tuning
High Availability
4) Network availability - Bonding, CARP
5) Database fail over - replication with master and slaves
6) Restoration - text and binary database restoration, remote recovery
I don't think there is anything with wrong with doing lots of research and then writing small amounts of code for tools where appropriate. My suspicion is that a well prepared project would win wide support and Google will support a project that has support of the mentoring organizations.
Cheers,
Kieran