Question 1

Implement a circular buffer (ring buffer) suitable for a real-time market-data pipeline. Must be lock-free for the single-producer single-consumer case.

Accepted Answer

Walk through the SPSC ring buffer with atomic read/write indices. The trick is the memory ordering — release-store on write, acquire-load on read — and the wraparound handling (size must be a power of 2 to use bitmask instead of modulo). The producer reads the consumer index with acquire to check there is room; it writes the slot, then publishes the new write index with release so the consumer cannot observe the index before the data lands. The consumer mirrors that — acquire-load the write index, read the slot, then release-store the bumped read index. Discuss the L5 follow-ups: multi-producer (need CAS on the write index, with a separate ‘published’ marker per slot so consumers know when a reservation has actually been filled), false sharing (cache-line pad the producer and consumer indices to 64 bytes each — without this, the two cores ping-pong the line on every operation and throughput collapses), and the bounded vs unbounded variants (unbounded is rarely what you want in a feed handler — drop or coalesce is almost always the correct policy under back-pressure). Mention disruptor pattern (LMAX) as the reference for high-performance ring buffers and the canonical thing to cite if the interviewer wants to go deeper.

Question 2

Given a stream of stock price ticks, implement a function that returns the maximum price in the last K ticks for any query, in O(1) amortized.

Accepted Answer

Sliding-window-maximum with a monotonic deque. The deque holds indices of candidates in decreasing order of value. On each new tick: pop from the back while the back element is smaller than the new tick (they can never be the max while the new one is in the window), push the new index, pop from the front if the front index is now outside the window. Max is always at the front. Walk through with an example — say ticks [3, 1, 4, 1, 5, 9, 2, 6] with K=3 — and trace the deque state at each step so the interviewer sees you actually believe the invariant. Each index enters and leaves the deque at most once, so total work is O(n) over n ticks, which gives O(1) amortised per tick. The L5+ follow-up: extend to query the max over arbitrary ranges (not just the last K), which is sparse tables (O(n log n) build, O(1) query, immutable) or segment trees (O(n) build, O(log n) query and update, mutable). Pick segment tree if the tick history is being updated as corrections arrive, sparse table if it is read-only history.

Question 3

Given a list of stock trades (timestamp, symbol, price, quantity), compute the VWAP (Volume-Weighted Average Price) per symbol in a streaming fashion. Then handle the sliding-window version (rolling 5-minute VWAP).

Accepted Answer

VWAP = sum(price × quantity) / sum(quantity). Streaming: maintain running sums per symbol in a hash map — on every trade, accumulate notional (price × quantity) and quantity, divide on read. O(1) per trade, O(S) memory where S is symbol count. Sliding window: a deque of (timestamp, price × quantity, quantity) per symbol, expire the front when its timestamp falls outside the window, recompute sums incrementally (subtract on expiry, add on arrival — do not re-sum the whole deque). Discuss memory: you are keeping every trade in the window, and for very high-volume symbols (think SPY, which can do hundreds of thousands of trades per minute) this can be large; sampling or aggregation per second is a common compromise — store one bucket per second with summed notional and quantity, and you cap memory at 300 buckets per symbol for a 5-minute window. The L5 question: how does this interact with corporate actions (splits, dividends) that retroactively adjust prices? Historical VWAPs computed before a 2-for-1 split halve overnight; you either store the raw trade and reapply adjustments on read, or store an adjustment-versioned VWAP and accept that re-runs change the number.

Question 4

Implement a function that detects when a stock price has ‘broken out’ — defined as a sustained move above the rolling 50-period max for the last N periods. Discuss the trade-offs of different definitions of ‘sustained’.

Accepted Answer

Walk through the rolling-max computation (use a monotonic deque as above), the ‘sustained’ check (consecutive periods above the rolling max — but how many periods? what threshold above? 0.1% over, 1% over, just strictly greater?), and the latency-vs-false-positive trade-off (longer confirmation = fewer false signals but slower entry; tighter threshold = more signals but more whipsaw). The implementation itself is short — for each new bar, update the rolling-max deque, check whether the current close exceeds the rolling max by the configured threshold, increment a consecutive-counter if so and reset it otherwise, emit a breakout signal when the counter hits N. The senior signal is recognizing that the definition matters more than the implementation; ask the interviewer to nail down the spec before writing code. Concretely: is the 50-period window inclusive of the current bar (then breakout is impossible by definition) or strictly prior? Does ‘above’ mean close above, or any tick above? Does the signal reset if the price falls back below mid-confirmation? These are the questions a Bloomberg engineer would expect you to surface, because every one of them is a real production-decision the team has had to make.

Question 5

Design Bloomberg’s market data distribution system — real-time price ticks from every major exchange must reach 300k terminals globally with sub-100ms latency.

Accepted Answer

Cover the pipeline end-to-end: the feed handlers at each exchange (custom C++ binaries co-located in the exchange’s data center, parsing exchange-specific protocols like FIX, FAST, ITCH for NASDAQ, PITCH for BATS — each protocol has its own quirks and the handlers are a major piece of internal IP), the normalization layer (proprietary internal format with a unified schema so downstream consumers see a single tick representation regardless of source), the distribution fabric (multicast within a data center because one-to-many is the natural pattern for market data, then private fiber WAN between DCs because you cannot trust the public internet for sub-100ms cross-continent delivery, then unicast TCP or a proprietary protocol to terminals at the edge), and the back-pressure handling (you cannot slow down the exchange; you must drop or aggregate ticks if a downstream is slow — common policies are conflation, where only the latest quote per symbol is kept, or sequence-gap recovery, where the consumer requests a snapshot to catch up). Discuss latency budgets at each hop — typically single-digit microseconds inside the exchange DC, single-digit milliseconds across the WAN, single-digit milliseconds for the last mile — and the tools used to measure them (hardware timestamping on the NIC, distributed traces, PTP for clock sync because NTP does not have enough resolution; PTP at minimum, hardware PTP on the NIC for the latency-critical paths).

Question 6

Design a time-series database for storing 50 years of historical market data — billions of (symbol, timestamp, OHLCV) tuples — with sub-second queries for arbitrary symbol + date range.

Accepted Answer

Cover: the storage format (columnar, with per-symbol files or partitions — Parquet or a custom columnar format; columnar wins because the dominant access pattern is ‘give me all OHLCV for AAPL from 2010 to 2020’, which touches a few columns over a large row range), the time partitioning (one file per symbol per month is a common balance — fine enough that a 1-day query reads one file, coarse enough that 50 years of one symbol is only 600 files), the compression (delta encoding on timestamps because consecutive timestamps differ by tiny amounts, dictionary encoding on symbols and exchange codes, run-length on sparse fields like halted flags), and the query layer (parallelized across partitions with a query planner that prunes irrelevant files using metadata footers — min/max timestamp, symbol set — before opening them). Discuss the harder problems: corporate actions (back-adjusting historical prices for splits and dividends — store the raw prices and apply an adjustment factor at read time, with the factor itself versioned by the date the action was announced), data quality (exchanges occasionally publish corrections — you must version a tuple, typically with a (symbol, timestamp, version) primary key and a read-time ‘give me latest version per (symbol, timestamp)’ pass), and the L5+ extension: tick-by-tick storage (billions of trades per day across thousands of symbols means you need a fundamentally different layer — append-only logs partitioned by exchange and hour, with bloom filters for symbol lookups, because the OHLCV partitioning scheme does not survive at that cardinality).

Question 7

Design Bloomberg Chat — a real-time messaging system for financial professionals, must support compliance archival (every message kept for 7 years), end-to-end encryption, and trader-specific features like inline market data.

Accepted Answer

Real-time messaging fabric (WebSocket connections to a sharded routing service, sharded by user_id so all of a user’s sessions land on the same shard and you can fan out a single delivery to all their devices cheaply), compliance pipeline (every message also written to an immutable archive — typically a write-once-read-many store like S3 with object lock, indexed by (firm_id, date, message_id) for the periodic compliance review workflow), end-to-end encryption that is compatible with compliance review (compliance officers can search archived messages — usually a ‘managed key’ model where the firm holds keys in an HSM, not the individual user; the message is encrypted with a per-conversation key, the key is wrapped to both the participants and the firm’s compliance key). Discuss the harder problems: cross-firm messaging (one firm’s compliance rules vs another’s — the message has to be archived twice, once per firm, and the encryption has to satisfy both firms’ key management policies), spam and harassment detection at scale (off-platform classifiers cannot see plaintext, so detection runs client-side or in a privacy-preserving server-side enclave), and the integration with terminal features (clicking a ticker symbol in a chat opens the chart — the chat client embeds a rich-message protocol that the terminal understands, with the chat backend treating these as opaque attachments).

Question 8

Tell me about a time you spent days hunting a bug.

Accepted Answer

Bloomberg’s culture values craft and rigor. Strong answers describe a deep technical investigation — often a race condition, memory corruption, or networking bug — and the systematic process you used. The shape: the bug’s symptoms (what was observed, by whom, how often, what made it worth multiple days of attention), the dead ends (the hypotheses you ruled out and how — this is the part most candidates skip, but it is the most informative because it shows your debugging discipline), the breakthrough insight (often a small observation that reframed the search — a timestamp that did not line up, a packet capture that showed an unexpected retransmit), and what tooling or habit you adopted afterward to catch this category of bug sooner (a new lint rule, a stress test, an assertion in production). The senior signal is naming the bug category precisely (e.g., ‘a TOCTOU race in the file-system watcher’, ‘a use-after-free triggered by a double-callback in an asynchronous I/O path’, ‘a torn read on a 64-bit value on a 32-bit ARM platform’ — that level of specificity shows pattern recognition and that you actually understood the bug rather than just stumbling onto the fix).

Question 9

Describe a project where you optimized for latency. What did you measure, and how did you measure it?

Accepted Answer

Bloomberg’s product is fundamentally a latency game. Strong answers walk through: the latency budget (target p50 / p99 / p99.9 — never just an average, because tail latency is the whole story in a real-time system), the measurement infrastructure (hardware timestamping on the NIC for wire-to-wire measurements, distributed traces for cross-service paths, eBPF probes for kernel-level events like scheduler delays and syscall costs, perf for CPU-level hotspots), the bottlenecks identified (often surprising — GC pauses on the JVM heap, kernel context switches when the work crosses a thread, NIC interrupt coalescing adding milliseconds of jitter, lock contention on a supposedly hot-path-free data structure), and the wins (specific numbers — ‘we cut p99 from 18ms to 4ms by pinning the feed-handler thread and disabling interrupt coalescing on that NIC’). Avoid claiming you optimized ‘everything’ — that reads as either dishonest or unfocused; show that you made calibrated choices, that you measured first and changed second, and that you knew when to stop (the latency budget was met, further optimisation had diminishing returns).

Question 10

Tell me about a time you mentored a junior engineer through a hard problem.

Accepted Answer

Bloomberg values craft transmission. Strong answers describe a specific engineer with a specific problem, the journey you walked them through (often a longer journey than you would have taken yourself, because the goal was their learning, not just the fix — you let them write the wrong abstraction first, asked the question that would expose the flaw, and let them feel the rewrite rather than handing them the right answer), and the moment they got it (the visible click — when they started spotting the same pattern elsewhere in the code without prompting). Reflect on what you learned about teaching — most senior engineers underestimate how much context is missing for a junior; the senior treats a piece of knowledge as obvious because they have used it for years, the junior has never seen it, and bridging that gap requires actually inventorying what you take for granted. A grounded answer names a habit you developed (asking ‘what is your mental model of X here?’ before correcting, writing things on a whiteboard rather than typing them, sending the link to the doc instead of summarising it so they build the reading habit).

Bloomberg Interview Questions — What the SDE Loop Actually Asks

How Bloomberg interviews are structured

Coding round Q&A

Q1. Implement a circular buffer (ring buffer) suitable for a real-time market-data pipeline. Must be lock-free for the single-producer single-consumer case.

Q2. Given a stream of stock price ticks, implement a function that returns the maximum price in the last K ticks for any query, in O(1) amortized.

Q3. Given a list of stock trades (timestamp, symbol, price, quantity), compute the VWAP (Volume-Weighted Average Price) per symbol in a streaming fashion. Then handle the sliding-window version (rolling 5-minute VWAP).

Q4. Implement a function that detects when a stock price has ‘broken out’ — defined as a sustained move above the rolling 50-period max for the last N periods. Discuss the trade-offs of different definitions of ‘sustained’.

System design Q&A

Q1. Design Bloomberg’s market data distribution system — real-time price ticks from every major exchange must reach 300k terminals globally with sub-100ms latency.

Q2. Design a time-series database for storing 50 years of historical market data — billions of (symbol, timestamp, OHLCV) tuples — with sub-second queries for arbitrary symbol + date range.

Q3. Design Bloomberg Chat — a real-time messaging system for financial professionals, must support compliance archival (every message kept for 7 years), end-to-end encryption, and trader-specific features like inline market data.

Behavioural Q&A

Q1. Tell me about a time you spent days hunting a bug.

Q2. Describe a project where you optimized for latency. What did you measure, and how did you measure it?

Q3. Tell me about a time you mentored a junior engineer through a hard problem.

During the actual loop