Neutralizing the effect of translation shifts on automatic machine translation evaluation