Semester 6 Exam
Required Output Classification
| Required output | Classification | Public/private guidance |
|---|---|---|
| Timed written answers, diagrams, code snippets, and design responses | Checkpoint evidence | Keep raw exam work private so it remains useful for assessment and retake calibration. |
| Post-exam review notes, missed-answer repairs, and Feynman explanations | Practice artifact | Use for spaced review; publish only rewritten explanations that no longer reveal exam solutions wholesale. |
| Capstone-defense or architecture-defense packets created from exam prompts | Portfolio candidate | Polish publicly only when they are original to your project, sanitized, and framed as engineering rationale rather than exam answers. |
This exam checks whether you can reason across the full data stack: schema, storage, transactions, and distributed tradeoffs. Treat it as closed-book on the first pass.
Rules
- Suggested duration: 2.5 hours
- Pass 1: closed book
- Pass 2: open notes only for self-correction and citation repair
- Allowed artifacts during grading: your semester notes, local book chunks, schema drafts, and project evidence
Section A: Short Answer and Definitions
Answer in 4-8 sentences each.
- Explain why relational modeling is still valuable even in systems that later add caches, queues, or search indexes.
- Define index selectivity and explain why it matters to query performance.
- Compare replication for availability with partitioning for scale. What different problems are they solving?
- Explain one transaction anomaly and one way a database system prevents it.
- Why is partial failure the default mental model for distributed systems rather than an edge case?
Section B: Applied Data Modeling and SQL
Use a commerce, ticketing, or learning-platform domain of your choice.
- Design a relational schema with at least five tables, appropriate keys, and at least three integrity constraints. Explain two modeling decisions that reduce future bugs.
- Write:
- one join-heavy query
- one aggregate query
- one query that benefits from a carefully chosen index
For each query, explain what you expect the engine to do and what would make the query slow.
Section C: Storage Engines and Indexing
- You have a workload with heavy writes, periodic compaction pressure, and large range scans for analytics. Compare a B-tree-oriented approach with an LSM-oriented approach and defend your choice.
- A team wants to add indexes to fix performance complaints. Describe the process you would use to decide whether the problem is indexing, schema shape, query formulation, or workload mismatch.
Section D: Transactions and Consistency
- A booking system occasionally double-allocates seats during peak load. Describe:
- the likely class of anomaly
- two possible fixes
- the cost of each fix
- Your product team wants read-your-writes behavior after profile updates, but the system uses replicated reads. Explain the issue and propose a design approach that satisfies the requirement.
Section E: Distributed Systems Reasoning
- A service is partitioned across regions. During a network event, each side can still serve some traffic. Explain the tradeoffs between continuing with reduced guarantees and halting writes until coordination is restored.
- Describe how time, retries, and duplicate delivery can interact to produce correctness bugs in a distributed workflow.
Section F: Cross-Cutting Engineering
- Define a test strategy for a schema migration that touches application code, background jobs, and reporting queries.
- List the observability and security controls you would require before trusting a production data service with sensitive records.
Scoring Rubric
| Section | Max points |
|---|---|
| A | 20 |
| B | 20 |
| C | 15 |
| D | 15 |
| E | 15 |
| F | 15 |
Suggested interpretation:
85-100: strong control of Semester 6 material70-84: ready to proceed with a focused repair listBelow 70: review modules before advancing
What Strong Answers Show
- mechanism, not slogans
- explicit tradeoffs, not one-sided claims
- evidence that storage, correctness, and operations are connected
- language another engineer could review and trust
Mastery Rubric
| Level | Evidence |
|---|---|
| Beginner pass | Can answer direct questions and complete familiar exercises with light notes. |
| Solid pass | Can solve new variants, explain choices, and connect the work to Semester 5 Operating Systems and Networking. |
| Strong pass | Can defend tradeoffs, identify failure modes, and produce clean evidence in the portfolio artifact. |
| Not ready | Relies on copied solutions, cannot explain mistakes, or lacks durable artifacts. |
Retake and Repair Rule
If a section is weak, do not only reread. Repair it by producing new evidence: a corrected solution, a fresh implementation, a rewritten proof, a benchmark, a diagram, a runbook, or a short teaching note.
Answer-Quality Examples
Use these examples when grading written answers or spoken explanations.
| Quality | Example pattern |
|---|---|
| Weak | Names a concept but gives no example, constraint, or failure case. |
| Acceptable | Defines the concept and applies it to a familiar exercise. |
| Strong | Applies the concept to a new variant and explains why an alternative would fail. |
| Portfolio-ready | Connects the concept to Semester 5 Operating Systems and Networking, current project evidence, and a future capstone decision. |
Interleaving Prompt
For any missed answer, add one sentence starting with: This depends on an earlier skill because...
Calibration Materials
Use these learner-visible calibration materials before self-grading or requesting review: