Use a non-streaming model for middleware-only calls so internal outputs do not leak into user streams and model-based middleware stays consistent.