Questions
Table of Contents
Total questions: 524
01 Programming
NET
When should you still target netstandard?
What is the pragmatic default for new libraries today?
Why are there no new .NET Standard versions after 2.1, and what does that mean for design decisions?
What are the three layers of the .NET platform, and why does that distinction matter?
What is the difference between .NET Framework and modern .NET?
Why does .NET consistently rank high in web framework benchmarks, and does that translate to application performance?
ASP.NET Web API
Where should authentication and authorization live in an ASP.NET Core API?
Why can't you revoke a JWT before its expiry?
When would you choose cookie authentication over JWT Bearer in an ASP.NET Core API?
What does
ValidateIssuerSigningKey = true actually enforce?When should you return 403 Forbidden vs 404 Not Found for an unauthorized resource access?
What is the difference between
context.Succeed() and context.Fail() in an authorization handler?How do you implement OR logic across two authorization policies on a single endpoint?
Does CORS protect your API from unauthorized access?
Why must app.UseCors() come before authentication middleware?
What is a captive dependency and why is it dangerous?
How do you use a Scoped service inside a Singleton without a captive dependency?
What is the difference between
GetService<T> and GetRequiredService<T>?Explain when to choose middleware over an MVC action filter.
What is the execution order of ASP.NET Core filter types?
How do you inject services into a filter safely?
Action filter vs middleware: what is the difference?
How can you log execution time for all requests?
How can you centrally catch errors for all requests?
What is the ASP.NET request processing pipeline?
CSharp
What makes C# a multi-paradigm language, and why does that matter for API design?
How does C#'s type system help prevent bugs at compile time?
Concurrency and Parallelism
How is asynchrony different from multithreading?
What is the difference between
await and Task.Result?When should you use
ConfigureAwait(false)?If async does not always use extra threads, why does it improve scalability?
When should you use
Task.Run with async code?When is it reasonable not to cancel immediately after the token is signaled?
Why is cooperative cancellation safer than
Thread.Abort?How do you propagate cancellation across a service boundary (e.g., HTTP call)?
What is the difference between concurrency and parallelism in practice?
Why do many production outages in .NET systems look like "performance" but are actually concurrency bugs?
What is the first decision before choosing a primitive (
Task, lock, Parallel, Channel)?Why is unbounded
Task.WhenAll often a bad first choice when calling around 300 external APIs in one request?For one requirement ("update shared state safely"), when do you choose
lock vs SemaphoreSlim vs Channel<T>?What are the four Coffman conditions and which is easiest to break in practice?
Why does calling
.Result on a Task deadlock in a UI app but not in a console app?How do you diagnose a deadlock in a production .NET service?
When is a named
Mutex the right tool in .NET?Why is
Mutex often a poor default for web request hot paths?What does
AbandonedMutexException signal?Why can adding more parallel workers reduce performance?
How do you decide
MaxDegreeOfParallelism?When should you avoid PLINQ?
Why can a parallel query be slower than sequential for small inputs?
When should you choose
SemaphoreSlim over lock?Why is a semaphore useful for fan-out HTTP calls?
What bug pattern most often breaks semaphore-based code?
Why is
Task not equivalent to a thread?When should
Task.Run be used in ASP.NET Core?Why is
Task.WhenAll usually better than sequential await for independent calls?When should you use
ValueTask instead of Task?What is ThreadPool starvation and how does it usually start?
Why can a fully async app still suffer ThreadPool issues?
When is it appropriate to call
ThreadPool.SetMinThreads?What is the difference between
Task.Run and ThreadPool.QueueUserWorkItem?Fundamentals
What is the difference between
throw; and throw ex; inside a catch block?When should you wrap an exception instead of rethrowing it directly?
Why is throwing from
finally considered dangerous?When might
finally not execute?What types can you use in
foreach?How is
foreach implemented under the hood?What is
yield and how does it work?Why and when should you use
yield return instead of returning a materialized collection like List<T>?Why does
IEnumerable<string> assign to IEnumerable<object>, but List<string> does not assign to List<object>?When should you mark a generic interface type parameter as
out or in?A generic method uses
default(T) as a fallback value. Why can this be dangerous in production code?Why might you need
ref for reference types if reference types are already passed by reference?What is an
in parameter used for?What are optional parameters in methods?
In a hierarchy where
Animal a = new Dog();, how can you make a.Category() return the derived value, and what does that imply for API design?When should you prefer
new over override?A base method is not marked
virtual, but you need derived-specific behavior. What are your options?When should you prefer file-scoped namespaces over block-scoped namespaces?
Why is reflection often a bad default in performance-critical code?
What is an attribute and why is reflection central to attribute-driven frameworks?
When should you choose reflection versus alternatives like interfaces, generics, or source generators?
Types
What is the difference between
abstract class and interface with default interface methods (C# 8+)? When would you still choose an abstract class?Can a static class implement an interface? Why or why not?
A
sealed override stops further overriding, but can a derived class use new to hide the sealed method? What happens at runtime?Can you have an abstract sealed class? What about in IL?
Why can
partial be dangerous with source generators? Give a concrete scenario.Two classes have identical fields and values. You compare them with
==. Why is it false, and what are the three ways to fix it?A static constructor throws an exception. What happens on subsequent accesses to that type?
What does a delegate compile to in IL/runtime terms?
How do you isolate failures in a multicast delegate so one bad subscriber does not break the rest?
Why is using
Func<Task> in multicast pipelines often wrong for async fan-out?How is an event different from a delegate field in terms of access control?
Why is
PriceChanged?.Invoke(this, e) preferred over if (PriceChanged != null) PriceChanged(this, e)?Why do event leaks happen, and how do you prevent them?
Is
EventArgs inheritance mandatory in modern .NET?How do you handle exceptions in event subscribers without losing later handlers?
In
record Wrapper(List<int> Items), if var b = a with { }; and an item is added to b.Items, does a observe the change, and why?When would you choose
record class over readonly record struct?If
Equals on a positional record is overridden to ignore one property, does GetHashCode still include that property, and what breaks?Can a record struct be used as a
Dictionary key safely? What do you need to watch out for?When should you choose
StringBuilder over string?Why can
ReferenceEquals(a, b) be false even when a == b is true for strings?How should string comparisons be written in production code?
For a high-throughput service processing 100k messages per second (Id, Timestamp, Status), would class or struct be the better model, and why?
Why does mutating a struct returned from a property or indexer not compile (or silently do nothing)?
Why does
ValueType.Equals perform so poorly on structs with reference-type fields? What should you do about it?Why can updating a value-type item inside
foreach fail to persist, and what are safe fixes?Where does boxing usually sneak in, and what is the practical mitigation in production code?
What criteria should drive choosing between
struct, class, and record class?Other
What problem did OWIN solve?
Is ASP.NET Core OWIN?
How should incremental migration vs full rewrite be chosen when modernizing an OWIN app under strict uptime requirements?
When should you use SignalR versus polling or server-sent events?
Why would a .NET team still need to understand OWIN today?
When is SignalR a good fit?
What is the first scaling problem you will hit?
Why are SignalR groups not enough for authorization?
Runtime
What is managed vs unmanaged code? Why does unmanaged interop require careful lifetime management?
What is the CLR and IL? How does JIT compilation affect startup vs steady-state performance, and when is NativeAOT a better choice?
When would you choose NativeAOT over JIT compilation?
Why does the GC use generations?
What is a memory leak? Is it possible in .NET? How?
What is the call stack? Can it overflow? What happens then?
Why do we need
using {} if there is a GC?What are
IDisposable and Finalize?What is the disposable (dispose) pattern?
How does .NET's generational GC work? Why does it use generations, and what are the main tuning tradeoffs?
What are the Small Object Heap (SOH) and the Large Object Heap (LOH)?
What does the CLR do when your application starts, and why does startup behavior matter?
How does garbage collection affect production latency, and what are the main tuning levers?
Can managed code have memory leaks, and what are the common causes?
02 Computer Science
Why does Big O analysis matter for a senior developer who can just benchmark?
When does algorithmic complexity matter less than constant factors?
How do you decide between optimizing data structure choice versus algorithm choice?
Algorithms
What is an algorithm? How is its efficiency measured?
Why is Big O not enough to choose the fastest algorithm in practice?
What is the difference between worst-case, average-case, and amortized complexity?
Disjoint Set
What is path compression and why does it matter?
What is union by rank and how does it complement path compression?
How does DSU detect cycles in a graph?
Why combine path compression with union by rank?
What real problems are naturally modeled by DSU?
Graph Algorithms
Why does Dijkstra require non-negative edge weights?
What data structures make Dijkstra practical at scale?
When should you prefer A* over Dijkstra?
When do you pick BFS over DFS?
Why is Dijkstra not valid with negative edges?
What representation should you use for a graph: adjacency list or adjacency matrix?
Search Algorithms
Why does binary search require sorted data?
How do you find the first occurrence of a duplicated value?
Why is
left + (right - left) / 2 preferred over (left + right) / 2?What does the prefix function (LPS array) encode and why does it matter?
When is KMP worth the implementation complexity over naive search?
How does KMP compare to Rabin-Karp for single-pattern matching?
When should you choose BFS over DFS?
Why can recursive DFS cause stack overflow and how do you fix it?
How do you detect cycles using DFS in a directed graph?
How does the rolling hash achieve O(1) window updates?
Why is character-by-character verification necessary on hash match?
When should you choose Rabin-Karp over KMP?
What is the first decision before picking a search algorithm?
Why is one search algorithm never best for all cases?
Sorting Algorithms
Why is bubble sort never used in production?
What is bubble sort's one practical advantage?
When is insertion sort faster than O(n log n) algorithms?
Why is insertion sort used as the base case in Timsort?
Why is merge sort preferred over quick sort for linked lists?
When does merge sort's O(n) space cost become a real problem?
What causes quick sort's O(n²) worst case and how does introsort prevent it?
Why is quick sort faster than merge sort in practice despite the same O(n log n) average?
How do you choose between Merge Sort and Quick Sort in production?
When is Insertion Sort still a good choice?
What does .NET's built-in Array.Sort use, and why?
Why does selection sort make exactly O(n) swaps, and when does that matter?
Why is selection sort not stable, and how can it be made stable?
Data Structures
What is a data structure? Which ones do you know? Which of them exist in .NET?
How do you choose between List, Dictionary<TKey, TValue>, and HashSet?
Why does collection choice matter more than micro-optimization?
When would you use LinkedList over List in .NET?
What data structure is used behind
Dictionary<TKey, TValue>?Why is
Dictionary usually faster than List for lookups?How does hash collision affect performance?
Does .NET provide a built-in
Graph<T> type?Which collections are typically used for BFS?
What is the difference between
HashSet<T> and List<T> for membership checks?Why can
HashSet<T>.Contains fail for logically equal objects?How does inserting a value into a hashtable work?
Why does using a hash code instead of comparing full keys speed up lookups?
Why is
Dequeue on a heap O(log n) instead of O(1)?Why can heap iteration look unsorted even when the structure is valid?
What is the practical reason to use
PriorityQueue<TElement, TPriority> in .NET?Why can hash map performance degrade from O(1) to O(n)?
Why does a bad
GetHashCode implementation create correctness and performance risk?Which .NET type is the standard hash map in modern code?
Why is
LinkedList<T> often slower than List<T> in real workloads despite O(1) inserts/removes?When is
LinkedList<T> the right choice in .NET?What is a common migration signal from
LinkedList<T> to List<T>?How is
List<T> implemented under the hood?What is the difference between
Count and Capacity in List<T>?How do
Clear() and Remove() affect Capacity in List<T>?Why is
Queue<T> suitable for BFS?When should you replace
Queue<T> with PriorityQueue<TElement, TPriority>?Why can a queue become a production reliability problem even if operations are O(1)?
Why is
Span<T> a ref struct?When should you choose
Memory<T> instead of Span<T>?Does slicing a span allocate?
When is an explicit
Stack<T> better than recursion?Why can
Stack<T> be a poor fit for work queues?What is the complexity of
Push and why is it not always constant in practice?Which built-in .NET collection is closest to a self-balancing tree?
When would you avoid recursive tree traversal?
03 Data Persistence
What isolation level should you use for a read-modify-write transaction, and why?
How does write-ahead logging (WAL) implement durability?
What makes caching hard?
How do you reduce cache stampede?
How do you fix stale read-after-write behavior in Redis user-profile caching without turning off caching?
What changes reduce p99 spikes and database CPU saturation in TTL-only caching with synchronized five-minute expirations?
What should be checked first when a multi-tenant cache leaks one tenant's data to another, and how is it prevented?
NoSQL
Which NoSQL family fits a user profile API with very frequent reads by user id, and why?
When is NoSQL a bad idea?
ORMs
How does EF Core's change tracker work, and when should you disable it?
What is the N+1 query problem and how do you detect it?
SQL
What is an index and what types exist?
When should a column go in INCLUDE rather than the index key?
Why can an index rebuild appear to fix a slow query, and how do you verify whether statistics were the real cause?
What is normalization and why do most systems stop at 3NF/BCNF?
When would you denormalize a table, and what risks does it introduce?
What is the difference between 2NF and 3NF, and how would you recognize a violation of each?
What are the three replication lag anomalies, and how do you mitigate each?
When would you choose synchronous vs asynchronous replication?
How does split-brain occur and how is it prevented?
What is the difference between WHERE and HAVING?
What is a stored procedure and how is it different from a function?
What is a Common Table Expression (CTE) and when should you use a temp table instead?
What are SQL Server transaction isolation levels?
How do you diagnose and optimize a slow query?
When should you shard a database, and what should you try first?
04 Networks
Architecture & Ops
Why is consistency hard in P2P systems?
How does WebRTC use P2P, and what is the role of STUN/TURN servers?
What is split tunneling and when would you use it?
Why does WireGuard have a smaller attack surface than IPsec?
What causes DNS leaks in a VPN setup, and how do you fix them?
Protocols
Why does a DNS change "take time" to propagate?
What is the difference between a recursive resolver and an authoritative server?
How does DNSSEC protect against cache poisoning?
What problem does HTTP/2 multiplexing solve compared to HTTP/1.1?
Why does HTTP/2 still have head-of-line blocking?
When would you choose HTTP/1.1 over HTTP/2?
Why is creating a new HttpClient per request dangerous in .NET, and what are the two correct alternatives?
What is the difference between Cache-Control no-cache and no-store?
Explain the difference between 401 and 403 status codes.
A client sends
POST /api/orders, times out, and retries. How do you prevent duplicate orders?Explain
PUT vs PATCH semantics and retry behavior.Why do many teams stop at Richardson Level 2 instead of Level 3 HATEOAS?
Why is RPC's 'local call' abstraction considered leaky?
When should you choose gRPC over REST?
Why does SMTP require SPF, DKIM, and DMARC for deliverability?
Why is
System.Net.Mail.SmtpClient deprecated in .NET?Why does gRPC not work well with L4 load balancers, and how do you fix it?
What happens if you call a gRPC service without setting a deadline?
Why is renaming a proto field safe but renumbering it is not?
Transport & Sockets
Why do partial reads happen with TCP sockets?
What is the difference between
Socket, TcpClient, and TcpListener?When would you choose UDP over TCP for a production system?
Why does TCP's three-way handshake exist, and when does its latency cost become a real problem?
How do flow control and congestion control differ, and what happens when you confuse them?
When is UDP preferable to TCP, and what reliability mechanisms do applications add on top?
Why does UDP have no congestion control, and what are the consequences?
What is QUIC and why is it built on UDP rather than TCP?
05 Architecture
Application Architecture
What is the Dependency Rule and why does it matter?
What is the difference between traditional layered and Onion/Clean Architecture?
What is the key difference between MVC and MVVM?
Why is the ViewModel more testable than the Controller?
How do you prevent version conflicts between plug-ins that depend on different versions of the same library?
How do you version extension point interfaces without breaking existing plug-ins?
When is plug-in architecture the wrong choice?
Distributed Systems
How do you design gateway aggregation endpoints for client efficiency, and what do you explicitly keep out of the gateway?
When would you choose a single API gateway versus a BFF approach?
Explain API Gateway vs Service Mesh in one architecture and where each policy belongs.
Your service uses Postgres for orders and Redis for session cache. During a partition, how do they behave differently and why is that acceptable?
If CAP is only about partitions, why do we still tune consistency levels on healthy clusters?
A team says "our database is AP, so conflicts are fine." What follow-up would you ask?
How do you guarantee read-your-writes when writes go to a strongly consistent store but reads come from an eventually consistent cache?
Which chat features should use linearizable, causal, and eventual consistency, and why?
Why can linearizability reduce availability during a network partition, and how do you reduce blast radius in design?
Why is 2PC problematic in microservices?
How does the Outbox pattern guarantee at-least-once event delivery?
Your AI service has four instances behind an L7 gateway with passive outlier detection enabled. One instance returns HTTP 500 on about 30 percent of requests. How does the LB detect and handle this?
When is Azure Load Balancer the better choice than Azure Application Gateway for an ASP.NET Core workload?
Why can sticky sessions hurt autoscaling outcomes, and what architecture is safer?
How do you design webhook consumers to prevent event loss and duplicate processing?
When would you choose webhooks over a shared message broker for inter-service event delivery?
How do you protect a webhook endpoint against replay attacks?
Message Queues
You need to process order events in order per customer but handle 50K events per second. How do you design the Kafka topic?
Compare at-most-once, at-least-once, and exactly-once in Kafka. Which do you choose for payment events and why?
When is MSMQ still the right choice over RabbitMQ or Azure Service Bus?
Why is MSMQ not available in .NET 5+?
Scalability Patterns
What architectural prerequisites must be met before horizontal scaling works?
Why can horizontal scaling fail even with many instances?
Your system handles 1,000 RPS today. Product wants 10x. How do you decide what to scale first?
When would you choose read replicas instead of CQRS for a scaling problem?
When is vertical scaling the right first move over horizontal scaling?
Why does vertical scaling have diminishing returns at high scale?
What's the failure mode of relying solely on vertical scaling for availability?
Patterns
Why does CQS make code easier to reason about?
When is it pragmatic to violate CQS?
Your singleton cache service must use a scoped
DbContext to warm data on startup. How do you solve this without captive dependency?Explain
Transient, Scoped, and Singleton lifetimes with one safe production example each.Why is Service Locator an anti-pattern in business logic, and when is it acceptable?
Why does EF Core's DbContext already implement the Unit of Work pattern?
When is a generic IRepository an anti-pattern?
Architectural Patterns
When is CQRS worth the operational complexity, and when is it an anti-pattern?
What is the difference between an Entity and a Value Object?
Why should external code only interact with an Aggregate through its root?
You need a full audit trail for financial transactions. Compare Event Sourcing with a CRUD model plus audit-log table. When does Event Sourcing justify its complexity?
How do you evolve event schemas safely without breaking old streams?
Design Patterns
What is the Information Expert principle and why does it reduce coupling?
How does GRASP differ from SOLID?
You need to add a new robot type to an existing system without modifying existing code. Which patterns enable this?
Resilience Patterns
Your AI service calls an LLM API and sees intermittent 429 plus occasional 500. How do you configure the breaker and what do you do when it opens?
Why is retry placement relative to circuit breaker important?
How do you avoid a half-open thundering herd in Kubernetes-scale deployments?
Your AI service wraps OpenAI APIs with per-tenant limits and runs on 4 instances. How do you enforce limits accurately, and which algorithm do you choose?
Expected answer
- Use distributed shared state, usually Redis, because per-instance memory breaks global accuracy.
- Partition by tenant ID so quotas align with billing and fairness.
- Choose token bucket when tenants need controlled burst capacity with stable average throughput.
- Use atomic operations (Lua or transaction pattern) for refill and consume to avoid race conditions.
- Return
429withRetry-Afterand remaining quota headers to support client backoff.
Why this question matters - It tests algorithm choice plus distributed systems correctness, not just definition recall.
Rate Limiting
When would you prefer sliding window counter over fixed window in a public API?
Expected answer
- Prefer sliding window counter when edge fairness matters and fixed window boundary bursts are unacceptable.
- It gives near-rolling behavior with lower memory than sliding log.
- Accept approximation error in exchange for better operational cost.
- Keep fixed window only where simplicity dominates and traffic patterns are predictable.
Why this question matters - It checks whether the candidate can justify tradeoffs under realistic constraints.
Rate Limiting
What failure mode should you choose if Redis-based rate limiting is unavailable: fail-open or fail-closed?
Expected answer
- Decide by endpoint risk profile, not globally.
- Fail-open for low-risk endpoints when availability is the top priority.
- Fail-closed for sensitive operations where abuse or cost explosion is unacceptable.
- Document and test the behavior with chaos drills.
Why this question matters - It tests operational judgment and explicit risk tradeoff reasoning.
Rate Limiting
System Architecture
Order service publishes
OrderPlaced. Payment and Inventory both consume it. Payment fails. How do you handle compensation without tight coupling?When would you choose orchestration over choreography in an event-driven workflow?
How do you evolve integration event contracts without breaking consumers?
Why can microservices lead to distributed data consistency problems, and how do you address them?
Order and Inventory are separate microservices. A customer places an order. How do you ensure inventory is reserved without a distributed transaction?
How do you decide between monolith, modular monolith, and microservices for a new product?
What is a modular monolith and when is it better than microservices?
When do microservices become justified over a monolith?
How do you mitigate cold start latency in serverless functions?
How do you avoid vendor lock-in with serverless functions?
How do you model cost for a serverless workload vs a container-based one?
What is the key architectural difference between SOA and microservices?
When is SOA still the right choice over microservices?
06 Development Practices
Paradigms
How does event-driven architecture differ from event sourcing?
How do you guarantee ordering of events in a distributed event-driven system?
Why must event consumers be idempotent in an at-least-once delivery system?
What makes a function "pure" and why does purity matter for testing?
How does immutability prevent bugs in concurrent code?
When would you choose LINQ over a manual loop in production code?
What does
WebApplicationFactory test that unit tests cannot?When should you use Testcontainers instead of EF In-Memory?
When should you prefer composition over inheritance?
What is the Anemic Domain Model and why is it considered an anti-pattern?
How does polymorphism reduce conditional complexity?
Why does TDD improve design, not just test coverage?
When is TDD not worth the overhead?
How do you handle external dependencies (DB, HTTP) in TDD?
What is the difference between a stub and a mock?
How do you test code that depends on the current time?
When should you NOT write unit tests?
Principles
What is DRY actually trying to prevent?
When is it OK to repeat code?
What is the difference between IoC and Dependency Injection?
What is the difference between IoC and the Dependency Inversion Principle (DIP)?
How do you distinguish 'simple' from 'simplistic' in a design review?
When is complexity justified despite KISS?
How does KISS interact with YAGNI and DRY?
Which SOLID principles does a typical Singleton violate?
When is it acceptable to violate SOLID principles?
Does YAGNI mean you should skip tests and refactoring?
When does YAGNI conflict with good design, and how do you resolve it?
07 Security
When is blockchain justified over a traditional database?
Why does PoW waste energy and what is the alternative?
Why does signing use the private key to encrypt the hash, not the public key?
What is the difference between signing and encryption?
Why is ECDSA preferred over RSA for new systems?
When should you use symmetric vs asymmetric encryption?
What is envelope encryption and when is it used?
Why is key management harder than choosing the right algorithm?
What is a JWT token and why is it not encrypted by default?
Why should JWTs have short expiry times?
Which OWASP Top 10 item is most commonly found in production .NET apps?
How do you prevent SQL injection in a .NET application?
Authentication
Why is Basic Auth unsafe over HTTP?
When is Basic Auth acceptable in production?
What is the difference between OAuth 2.0 and OpenID Connect?
What is the difference between role-based and resource-based authorization?
Why inject
IAuthorizationService instead of checking ownership in the controller directly?What problem does SSO solve and what does it not solve?
What must you validate when accepting an ID token from an IdP?
When would you choose SAML over OIDC?
Why is FIDO2/WebAuthn more secure than TOTP?
When should you use TOTP vs FIDO2?
08 SDLC
When does BDD overhead outweigh its benefits?
How does BDD differ from TDD, and are they complementary?
What is 'specification by example' and how does it reduce requirement ambiguity?
Why are story points measured in relative complexity rather than hours?
What is the most common estimation mistake and how do you avoid it?
09 DevOps
When should you use GitHub Actions vs Azure DevOps Pipelines?
What makes a CI pipeline 'good' vs 'fast but unreliable'?
Why use multi-stage builds for .NET applications?
How do you prevent secrets from leaking into Docker images?
What is the difference between a Pod, Deployment, and Service?
Why do Kubernetes Secrets need additional protection beyond base64 encoding?
What causes CrashLoopBackOff and how do you debug it?
How do you diagnose an intermittent p95 or p99 latency bottleneck in a multi-service system?
When should you use RED vs USE in a distributed system interview design?
Why instrument with OpenTelemetry from day one instead of adding observability later?
Deployment Strategies
When would you choose canary over blue-green deployment?
What failure mode makes rolling deployments unsafe for database schema changes?
How do you decide between A/B testing and canary when both are available?
Version Control Systems
Why does GitFlow create integration problems for teams practicing continuous deployment?
What does trunk-based development require that makes it unsuitable for all teams?
10 Cloud
What is the key operational difference between IaaS and PaaS?
Where does CaaS fit between IaaS and PaaS?
11 AI & ML
LLM
How can Matryoshka dimensionality reduction lower embedding storage costs without significant recall loss?
Why can switching to a higher-scoring embedding model cause recall to drop on existing queries?
When is domain-finetuning the embedding model justified over improving chunking or retrieval?
Why is adjusting temperature and top_p simultaneously discouraged?
Why can a grounded response still contain unsupported claims despite citation tags?
When should constrained decoding be preferred over JSON mode for structured output?
What is the minimum useful guardrail set for a production LLM application?
Can you rely on a safety filter alone?
How do you test guardrails?
Why can RAG-grounded systems still hallucinate significantly?
Why does RLHF increase hallucination risk while perceived quality improves?
How do you separate retrieval failure from generation hallucination in a RAG pipeline?
Why is prompt injection fundamentally harder to prevent than SQL injection?
How does Excessive Agency (LLM06) compound with Prompt Injection (LLM01)?
Why should system prompts be treated as public rather than secret?
Agents
Why does the ReAct pattern outperform chain-of-thought reasoning alone for tasks requiring external knowledge?
What are the three most critical production safeguards for an agent loop?
When is a simple prompt chain preferable to an agent loop?
When should you use a workflow instead of an autonomous agent?
Why does Anthropic recommend starting with the simplest possible solution?
Why does MCP use a host-client-server architecture instead of letting the LLM talk to servers directly?
When would you choose function calling over MCP even for a shared tool?
What makes MCP servers harder to secure than traditional REST APIs?
When is multi-agent coordination justified over a single agent with more tools?
Why does context-centric decomposition outperform problem-centric decomposition?
What makes multi-agent failures harder to diagnose than single-agent failures?
Why is tool design often more impactful than prompt engineering in agentic systems?
How would you decide between one broad tool and many narrow tools?
What makes a tool safe for caching in an agent loop versus unsafe?
Evaluation
Can deterministic checks replace LLM judges?
What is the minimum useful set of deterministic checks for a tool-using agent?
How do I know if I'm overfitting my prompt?
What should I evaluate first?
When are classic metrics (BLEU/ROUGE) useful?
How big should the golden set be?
How do I choose what to target first?
When should I prefer LLM-as-a-judge over classic metrics, and how do I know the judge is trustworthy?
When should I prefer pairwise comparisons over rubric scorecards?
What are the most dangerous pitfalls when using LLM-as-a-judge in production?
Should I rely on online metrics only?
How do you determine the minimum sample size for an A/B test?
What is the difference between data drift and model degradation in online evaluation?
Prompting
When is automated prompt optimization worth the setup cost?
What is PAL's key insight compared to chain-of-thought prompting?
When should you start with zero-shot versus few-shot?
What makes a good few-shot example set?
What are the main failure modes of few-shot prompting?
When should you choose prompt chaining instead of a single large prompt?
What is the main risk of generated knowledge prompting, and how do you mitigate it?
What is a practical meta prompting workflow for improving a weak prompt?
Why do prompt anatomy and model settings have to be designed together?
When should you prefer few-shot prompting over pure instruction prompting?
How would you debug a prompt that is accurate but too verbose and expensive?
When does Chain-of-Thought usually help, and when can it hurt?
Explain the cost vs. accuracy tradeoff in self-consistency.
When is Tree of Thoughts justified over CoT or self-consistency?
RAG
Why should retrieval cache keys be based on processed query text instead of raw embeddings?
Why is response caching riskier than embedding caching?
When is semantic caching safe to deploy, and when should it be avoided?
Why does parent-child chunking often improve answer completeness over child-only retrieval?
When should a team move from recursive to structure-aware chunking?
Why is semantic chunking not always superior to simpler rule-based approaches?
Why can aggregate retrieval metrics improve while individual user segments degrade?
Why are relative regression thresholds preferable to absolute quality targets for RAG release gates?
When should a team invest in building a human-annotated golden evaluation set versus relying on synthetic QA generation?
Given high Faithfulness (0.91) and low Context Recall (0.54), which pipeline layer do you fix first and why?
Why is sampled LLM-as-judge scoring preferred over scoring every response in production?
Why should RAG alerting use relative regression thresholds instead of absolute quality targets?
How does monitoring differ from evaluation in a RAG system, and why do you need both?
Why should advanced RAG patterns be introduced incrementally instead of all at once?
When is Graph RAG a better fit than plain vector retrieval?
How does Adaptive RAG reduce cost without sacrificing accuracy on complex queries?
Why does query translation often improve recall but sometimes hurt precision, and how do you detect the tradeoff?
When is decomposition a better choice than multi-query, and when does it hurt?
Why can HyDE outperform direct query embedding for vague questions but fail on specific factual queries?
Why can reranking improve offline nDCG without visible quality improvement for end users?
When does reranking hurt retrieval quality instead of helping?
Why is ColBERT faster than a cross-encoder at query time despite also using token-level scoring?
Why can vector-only retrieval underperform on technical support workloads?
When does hybrid retrieval perform worse than single-mode retrieval?
Why does HNSW recall degrade silently as the vector database grows?
Machine Learning
What is the difference between data drift and concept drift?
How do you detect drift when labels are delayed?
How should an ML pipeline be designed to ship weekly batch churn scoring now while preserving a path to real-time scoring later?
What should be checked first when a binary classifier shows 98 percent accuracy but support tickets rise, and which metric should be optimized next?
What tradeoffs and rollout plan are appropriate when the best model exceeds a strict API latency budget?
When should you fine-tune a small model instead of prompting an LLM for an NLP task?
Why do multilingual NLP models underperform monolingual ones, and when is that acceptable?
Why start every new AI feature in Shadow Mode?
What signals indicate it is safe to move from Partial to Full Automation?
Evaluation
When should you optimize precision vs recall, and how do you operationalize the choice?
How do you pick a classification threshold in practice?
Your model has F1 = 0.78. The PM says it's "good enough." What questions do you ask?
When should you distrust a single evaluation metric?
What does ROC-AUC measure?
When should I use PR-AUC instead of ROC-AUC?
Why can high accuracy be misleading on imbalanced data?
Tooling
Why is an agentic coding tool not just "better autocomplete"?
When should a team prefer terminal agents over IDE agents?
What controls reduce production risk when adopting coding agents?
Why do hierarchical instruction files often outperform a single giant root file?
What is the fastest way to detect that an instruction file is harming agent quality?
When should a team keep instructions minimal and rely more on repository inspection?
Why should destructive-operation controls live in PreToolUse rather than PostToolUse?
How do you design hook pipelines that improve quality without making the agent unusably slow?
When should you use an LLM-based hook instead of a deterministic script?
Why does MCP reduce integration complexity for agent tooling ecosystems?
When should a team intentionally limit plugin count even if more integrations are available?
Why do skills usually improve agent reliability more than repeating instructions in every prompt?
When should you choose project-scoped skills over user-global skills?
How do you choose between a terminal-first coding agent and an IDE-first coding agent?
Why are agent instructions and hooks more important than model quality in long-running repos?
What is the practical difference between a coding agent and a code review agent?