Smaller self-indexes for natural language