Question 1

Reverse a linked list in place — recursive and iterative. Which one would you submit and why?

Accepted Answer

The iterative version is the safer answer. Walk three pointers — prev, curr, and next — flipping curr.next to prev as you advance. It runs in O(n) time and O(1) auxiliary space, which is what the interviewer is looking for. The recursive version is elegant but pushes n stack frames, which a Microsoft interviewer will probe on: 'What happens if this list has ten million nodes?' Stating the trade-off out loud — 'I'd ship iterative because the recursion blows the stack on long lists' — is the kind of pragmatic reasoning the loop rewards. If they push for recursion anyway, write it cleanly: base case returns the node, recurse on next, then point next.next back at current and null out current.next.

Question 2

Given a string, find the length of the longest substring without repeating characters.

Accepted Answer

Sliding window with a hash map of character to last-seen index. Track a left pointer and walk right across the string. When you see a character whose last index is at or after left, jump left to oneAfterLastIndex. Update the answer with right minus left plus one on every step. This is O(n) time and O(min(n, alphabet)) space. Microsoft loves this question because it tests four things in one shot: window invariants, off-by-one discipline on the left jump, choice of data structure, and whether you instinctively reach for a hash map versus an array of size 128 for ASCII. Mention the ASCII optimization at the end — interviewers note it.

Question 3

You are given a binary tree. Return its level-order traversal as a list of lists, one list per level.

Accepted Answer

BFS with a queue, but the trick is producing the per-level grouping. After each enqueue cycle, snapshot the queue size at the start of the level — that is exactly how many nodes belong to this level. Pop that many, push their children, and append the popped values to the level list. The common bug is using a single while loop without the size snapshot, which mixes levels together. Time is O(n), space is O(w) where w is the maximum width of the tree. If the interviewer asks for the variant where odd levels are reversed (zigzag traversal), the cleanest mutation is a boolean flag and reversing the level list before appending — do not try to push to a deque from both ends, the bookkeeping gets ugly under pressure.

Question 4

Implement a function that detects whether a singly linked list contains a cycle, and if so, returns the node where the cycle begins.

Accepted Answer

Floyd's tortoise and hare. Move slow by one and fast by two until they either meet (cycle exists) or fast hits null (no cycle). To find the entry node, reset one pointer to the head after they meet, then advance both by one — they meet again at the cycle entry. The proof is worth knowing because Microsoft interviewers sometimes ask for it: if the non-cycle prefix has length a and the cycle has length c, the meeting point is a + b inside the cycle, and the math collapses to a equals c minus b modulo c, which is exactly the head-to-entry distance. O(n) time, O(1) space. The hash-set version is O(n) space and is acceptable as a starter, but the follow-up will always be 'can you do it in constant space?'

Question 5

Given two sorted arrays, merge them into one sorted array in place — assume the first array has enough trailing space to hold both.

Accepted Answer

Walk both arrays from the back, not the front. Two pointers at the last real element of each array, plus a write pointer at the end of the destination. At each step, copy the larger of the two values into the write slot and decrement the corresponding pointer. When the second array's pointer goes negative you are done — any remaining elements in the first array are already in place. The forward-walking version is the classic mistake: it overwrites unread values in the first array unless you copy the first array off to a temp buffer, which costs O(m) extra space. The reverse walk costs O(1). This shows up at Microsoft because it tests whether you instinctively look for the in-place trick.

Question 6

Given an array of integers, find all triplets that sum to zero, with no duplicate triplets.

Accepted Answer

Sort first — sorting is what makes deduplication tractable. Then for each index i, run a two-pointer pass on the subarray to its right looking for a pair that sums to negative nums[i]. Skip i forward whenever nums[i] equals nums[i-1], and inside the two-pointer loop skip the left and right pointers past duplicates after each successful triplet. The full complexity is O(n^2) time, O(1) extra space ignoring the output. The naive triple-loop is O(n^3) and will not pass. The hash-set per-element version is also O(n^2) but uses O(n) extra space and is harder to deduplicate cleanly. Microsoft interviewers care about the deduplication detail — saying 'sort plus two pointers' is half the answer; explaining the three skip conditions is the other half.

Question 7

Design and implement an LRU cache with O(1) get and put.

Accepted Answer

Hash map plus doubly linked list. The hash map keys to list nodes; the list orders nodes by recency, with the head as most recent and the tail as least recent. On get, look up the node, splice it out of its current position, and move it to the head. On put, if the key exists, update the value and move to head; if it does not, create a new node at the head and, if size exceeds capacity, evict the tail and remove its key from the map. The reason a doubly linked list is non-negotiable is the splice operation — singly linked lists cannot remove a known node in O(1) without holding the previous pointer. Use a dummy head and dummy tail to avoid null checks in the splice helper. This question is a Microsoft staple at level 62 and above because it forces you to combine two data structures correctly and handle the eviction edge case where you remove the same key you are inserting.

Question 8

How would you design a calendar application — and then, given the constraints we discuss, how would you store recurring events?

Accepted Answer

Start with the user-visible features so the interviewer knows you have a model in your head: events with title, start, end, attendees, location, recurrence, and reminders; views by day, week, and month; conflict detection. Then narrow when they push you toward storage. The recurrence problem is the meat — you do not store every instance of a daily standup for the next ten years. You store one base event with a recurrence rule (RRULE in iCalendar terms — frequency, interval, count or until, by-day, by-month-day) and an exceptions table for cancellations and edits to specific instances. A query for 'show me this week' expands the rule on read inside the requested window. The follow-up Microsoft tends to ask is 'what if a user edits one instance' — that becomes a row in the exceptions table that overrides the generated instance. The hybrid framing is the point: the interviewer is checking whether you can take a fuzzy product question, pin down a concrete schema, and reason about a real edit-flow under follow-up pressure.

Question 9

How would you implement a function to validate that a Sudoku board is currently legal — and then, walk me through how you would extend it to a full solver?

Accepted Answer

Validation is three sets of nine constraints: each row has unique digits one through nine, each column has unique digits, each of the nine three-by-three sub-boxes has unique digits. Walk the eighty-one cells once, hashing into nine row-sets, nine column-sets, and nine box-sets keyed by floor(row/3)*3 + floor(col/3). Any duplicate insertion fails the board. O(1) time and space because the size is fixed. For the solver extension, it becomes backtracking: find the first empty cell, try digits one through nine, check the three constraints incrementally (do not re-validate the whole board each time — keep the three sets live and update them on push and pop), and recurse. If no digit works, backtrack. The interviewer is watching whether you reuse the validation data structures inside the solver instead of rebuilding them — that is the 'as appropriate' insight Microsoft loops weight heavily.

Question 10

How would you build a feature that auto-saves a document every few seconds without losing the user's work if the network drops?

Accepted Answer

Two layers. Layer one is local: every change writes to an in-memory buffer with a sequence number, and a debounced flush (say every two seconds of inactivity, or every five seconds maximum) persists the buffer to local storage — IndexedDB on the web, the platform's preference store on desktop. The user never loses work to a tab crash. Layer two is sync: a background worker reads from local storage and POSTs deltas, not full snapshots, to the server. Each delta carries the sequence number; the server acknowledges up-to sequence N, the client trims its local queue. On reconnect, the queue replays in order. The conflict question is the standard follow-up: who wins when two devices edit offline? The pragmatic answer for a single-user document is last-writer-wins on the server clock; for collaborative editing you need operational transforms or CRDTs. Microsoft will probe on the trade-off — name it.

Question 11

How would you handle rate limiting for an API endpoint — and how does your answer change if the endpoint is hit by a customer's ten-thousand-machine fleet versus a single browser?

Accepted Answer

Single-browser case: token bucket per user identifier with the bucket sitting in an in-memory store like Redis. A request decrements; if the count goes negative, return 429 with a Retry-After header. Refill at a fixed rate. The fleet case forces a different decomposition. Per-user limits do not help because all ten thousand machines share the same user. You move the limit to the API key plus origin IP, and you accept that strict global synchronization is too slow at high QPS. The realistic implementation is local in-process counters synced asynchronously to a central store, with a small over-shoot tolerance — the alternative is taking a Redis round-trip on every request, which adds latency on the hot path. The Microsoft hybrid framing is to start with the user-visible behavior (what does the customer see when throttled?) and let the implementation fall out of the constraints. They are listening for whether you trade strict correctness for latency at the right point.

Question 12

How would you design and implement an undo-redo stack for a text editor — and what changes when the editor is collaborative?

Accepted Answer

Single-user version is two stacks of operations. Every user action pushes onto the undo stack and clears the redo stack. Undo pops from undo, applies the inverse, pushes onto redo. Redo pops from redo, reapplies, pushes back onto undo. Operations carry enough state to invert themselves — a delete carries the deleted text and the position. The collaborative version breaks the simple stack model because remote operations can land between your local actions. You cannot blindly undo your last action if a colleague has since edited the same paragraph. The fix is operational transformation: each user has their own undo stack of operations, and undoing means computing the inverse and transforming it against any concurrent operations that have arrived since. The interviewer is watching whether you recognize that the data structure does not change but the semantics do, and whether you can articulate why naive stack-pop is wrong in the multi-user case.

Question 13

Design Outlook — the email client and server, end to end.

Accepted Answer

Frame the problem first: hundreds of millions of mailboxes, billions of messages per day, sub-second send and receive, full-text search, calendar integration, mobile and desktop clients, offline support. Then partition. The mailbox storage layer is sharded by user ID — each user's mailbox is a contiguous append-only log of message envelopes plus a separate blob store for attachments. The send path goes through SMTP with a queue (one per outbound shard) and DKIM/SPF signing on the way out. The receive path lands in a per-user folder, runs spam and malware classifiers, then writes to the mailbox log and updates a search index built per-user (because search is always scoped to one mailbox). The sync protocol to clients is delta-based: clients keep a sync token, ask the server 'what changed since this token,' and apply the delta. Calendar shares the storage substrate but has its own recurrence-rule expansion logic. The places to dig in if the interviewer pushes — and they will — are: how you handle a user whose mailbox is too big to fit on one shard (split into folders mapped to sub-shards), how attachments get deduplicated across recipients (content-hash addressing in the blob store), and how the search index stays consistent (eventual consistency with a per-user lag budget of a few seconds is acceptable). Microsoft cares about the storage model and the sync protocol — those are the levers.

Question 14

Design OneDrive sync — the file-system replication that keeps a folder consistent across multiple devices.

Accepted Answer

The hard part is not 'upload files to a server.' The hard part is conflict detection, partial syncs over flaky networks, and bandwidth efficiency. Each file gets a content hash and a per-file version vector. Clients run a local watcher (FSEvents on Mac, ReadDirectoryChangesW on Windows) that produces a stream of change events. A change is enqueued with the file's hash and a parent version vector. The server accepts changes that fast-forward the version vector and rejects ones that conflict. Conflicts get materialized as a duplicate file with a suffix — '(conflict copy from device X)' — and surfaced in the UI. Bandwidth efficiency means chunking files into fixed or content-defined blocks, hashing each block, and only uploading blocks the server does not already have — this is what makes a one-byte edit in a one-gigabyte file fast. The download path is symmetric: the client asks 'what blocks do I need for this version,' fetches missing blocks, and reassembles. Selective sync is folder-level subscription: the client tells the server 'I only care about these subtrees,' and the server only streams change events for those. The Microsoft-flavored follow-up is almost always around the watcher: what happens when the user is offline for two weeks, the watcher queue is huge, and the server has also moved forward? The answer is a full reconciliation pass — walk the local tree, compute hashes, diff against server state, push and pull as needed.

Question 15

Design Teams chat — one-to-one and group messaging at large scale, with presence and typing indicators.

Accepted Answer

Three subsystems. First, message storage: a chat is a partitioned log keyed by chat-ID, with messages appended in order. For one-to-one, the chat-ID is a deterministic hash of the two user IDs sorted; for groups, it is a UUID. Reads are tail-driven — clients fetch the last N messages and subscribe to new ones. Second, the connection layer: persistent WebSocket or long-poll connections from clients to a fleet of edge servers. Each user is mapped to one edge server at a time, looked up via a presence service. When user A sends to user B, the message lands in the chat log, then a fanout worker looks up B's edge server and pushes the message down B's socket. If B is offline, the push is skipped — the next time B connects, the client reads the unread tail from the log. Third, presence and typing: presence is a TTL'd entry in a fast key-value store, refreshed on heartbeat; typing indicators are ephemeral fanouts that do not touch the message log at all because they are not durable. The two scale problems Microsoft will probe are large groups (a 10,000-person Teams channel) — for which you batch-fanout via a per-channel subscription tree rather than naively pushing to 10,000 sockets one at a time — and read-receipts, which need their own append-only structure keyed by (chat-ID, user-ID) to avoid hot-spotting on the message rows.

Question 16

Tell me about a time you received feedback that was hard to hear. What did you do with it?

Accepted Answer

This is a Growth Mindset probe — the most heavily weighted Microsoft cultural attribute. The pattern that lands is: state the feedback specifically (vague feedback in vague answers reads as evasive), describe your initial reaction honestly including any defensiveness, narrate the concrete behavior change you made, and close with the second-order outcome — 'two quarters later, my next review explicitly called out that I had improved on this.' The trap is reciting a hero arc where you immediately accepted the feedback gracefully — interviewers know that is rarely true and read it as polished rather than authentic. The 'learn it all, not know it all' framing is the explicit Microsoft language; you do not have to use that exact phrase, but the answer should embody it. Tie the change back to a measurable improvement.

Question 17

Describe a time you disagreed with a senior engineer or your manager. How did you handle it?

Accepted Answer

Microsoft is looking for collaborative disagreement, not heroic dissent. The structure: the disagreement was technical and specific (not a personality clash), you raised it in the appropriate forum (one-on-one or design review, not a group Slack callout), you brought data or a working prototype to the conversation, and you were genuinely open to being wrong. The closer matters most: either you persuaded them and the decision changed, or they persuaded you and you publicly supported the chosen direction. Both endings are fine. The losing answer is one where you 'were right all along' and the project failed because nobody listened — that reads as a teammate problem, which is a hire-no flag. Microsoft is partner-heavy as a culture; they want to see that you can disagree, commit, and move forward without resentment.

Question 18

Tell me about a project that failed. What did you learn?

Accepted Answer

Pick a real failure with a real cost — not 'we shipped a week late.' A scoped project that got cancelled, a launch that regressed a metric, an architectural choice that had to be reversed. Walk through what you missed and why you missed it (premise was wrong, stakeholders were not aligned, scope grew past your budget, you over-estimated team velocity). Then, and this is the part candidates skip, describe the structural change you made afterward — a new pre-mortem checklist, a different cadence with the PM, a habit of writing one-page design docs before any work starts. Microsoft's customer obsession value sits behind this question: did the failure teach you something about what users actually needed, versus what your team thought they needed? If yes, name it. The answer is not 'I learned to work harder' — that signals you have not actually reflected.

Question 19

Give an example of when you had to learn a new technology or domain quickly to ship something.

Accepted Answer

This is a direct Growth Mindset signal. Pick something where the learning curve was real — a new programming language for a critical-path service, an unfamiliar domain like cryptography or video codecs, an internal Microsoft system you had never touched (Cosmos, Substrate, whatever). Show the learning strategy: who you talked to, what you read, what you built as a learning artifact, how long the ramp took, and what the shipped outcome was. Bonus points for naming the moment you realized you were going to make the deadline — interviewers like specificity that suggests the story is real. The losing version is 'I read the docs and figured it out' — that is what every candidate says. Real answers have a person who helped, a sticking point that was non-obvious, and a shipped thing at the end.

Question 20

Tell me about a time you used customer feedback or data to change a product or technical decision.

Accepted Answer

Customer obsession is a Microsoft cultural pillar and shows up in almost every behavioral loop, especially at level 62 and above. The story should go beyond 'we ran a survey and added the top feature.' The strongest answers describe a moment where the customer signal contradicted the team's prior belief — a feature you assumed was loved had low engagement, a bug nobody prioritized was the top support driver, a workflow your PM was sure mattered turned out to be a niche. Then describe the technical or product change: scoping work, deprecating a feature, rebuilding a flow. Close with the metric that moved. The framing the loop wants to hear is that you treat the customer as a real source of truth — not as a vague justification you reach for after the fact.

The level system, and why the loop feels different at each band

How a Microsoft interview loop is structured

Coding round — real questions and structured answers

Q. Reverse a linked list in place — recursive and iterative. Which one would you submit and why?

Q. Given a string, find the length of the longest substring without repeating characters.

Q. You are given a binary tree. Return its level-order traversal as a list of lists, one list per level.

Q. Implement a function that detects whether a singly linked list contains a cycle, and if so, returns the node where the cycle begins.

Q. Given two sorted arrays, merge them into one sorted array in place — assume the first array has enough trailing space to hold both.

Q. Given an array of integers, find all triplets that sum to zero, with no duplicate triplets.

Q. Design and implement an LRU cache with O(1) get and put.

Mixed coding + behavioral — the Microsoft hybrid format

Q. How would you design a calendar application — and then, given the constraints we discuss, how would you store recurring events?

Q. How would you implement a function to validate that a Sudoku board is currently legal — and then, walk me through how you would extend it to a full solver?

Q. How would you build a feature that auto-saves a document every few seconds without losing the user's work if the network drops?

Q. How would you handle rate limiting for an API endpoint — and how does your answer change if the endpoint is hit by a customer's ten-thousand-machine fleet versus a single browser?

Q. How would you design and implement an undo-redo stack for a text editor — and what changes when the editor is collaborative?

System design — Microsoft-flavored problems

Q. Design Outlook — the email client and server, end to end.

Q. Design OneDrive sync — the file-system replication that keeps a folder consistent across multiple devices.

Q. Design Teams chat — one-to-one and group messaging at large scale, with presence and typing indicators.

Behavioral — Growth Mindset and customer obsession

Q. Tell me about a time you received feedback that was hard to hear. What did you do with it?

Q. Describe a time you disagreed with a senior engineer or your manager. How did you handle it?

Q. Tell me about a project that failed. What did you learn?

Q. Give an example of when you had to learn a new technology or domain quickly to ship something.

Q. Tell me about a time you used customer feedback or data to change a product or technical decision.

What separates a hire from a no-hire across the loop

Practice these on a live Microsoft loop, with PhantomCode