Why can someone watching my encrypted LLM traffic still infer what I asked?

기회

Whisper Leak, disclosed in late 2025, demonstrated that analyzing packet timing and size patterns in encrypted streaming LLM responses classifies prompt topics with greater than 98% precision across 28 major providers. Some providers including OpenAI and Mistral deployed fixes, but those mitigations address token-length patterns only. A separate attack exploits speculative decoding: the number of tokens accepted per decoding step varies with output content, and that signal leaks through even padded connections because padding does not eliminate the acceptance-rate fluctuation. Proposed defenses such as token batching reduce attack accuracy by 50% but do not eliminate it, and random padding imposes up to 8.7x payload overhead with residual leakage. No provider has shipped a complete mitigation for the speculative decoding variant.

왜 중요한가

Any user querying a streaming LLM from a network that logs traffic is leaking the topic of their query regardless of TLS encryption, including users who believe they are communicating privately with a medical, legal, or financial assistant.

기회 평가 방식

기회 점수는 측정값이 아닌 제 주관적 평가입니다. 얼마나 불편한지, 얼마나 자주 발생하는지, 현재 해결책이 얼마나 부족한지를 반영합니다. 점수가 높을수록 만들 가치가 더 높다고 생각합니다.

심각도8/10

발생했을 때 얼마나 큰 불편을 초래하는지.

빈도8/10

실제로 얼마나 자주 접하게 되는지.

공백 영역8/10

현재 이를 해결할 만한 도구가 얼마나 부족한지.

해결할 가치 있는 더 많은 문제들

탭을 닫는 순간 모든 AI 앱이 나를 잊어버리는 이유는 무엇일까?

새로운 분야를 배우는 것이 여전히 무엇을 물어야 할지 아는 것에 의해 제한받는 이유는 무엇일까?

비전문가는 왜 AI가 말한 내용을 검증할 수 없을까?

모델을 벤치마크로 테스트하고 감으로 배포하는 이유는 무엇일까?

AI 에이전트는 왜 자신의 실수를 기억하지 못할까요?

모델이 실제로 무엇으로 학습했는지 왜 감사할 수 없을까요?

← 해결할 가치 있는 모든 문제들 Anurag 소개 →