ANALHITZA: a tool to extract linguistic information from large corpora in Humanities research