Note that callers should not be releasing buffers that they allocated. So I think we need only be consistent within our implementation and within mechanisms that plug into our implementation.