PreCOMET
The PreCOMET is a fork of Unbabel/COMET from version 2.2.4.
All original licensing applies.
It is used in the subset2evaluate package.
The divergent fork was created such that this package does not conflict with the original one.
Models
The models are available on Huggingface:
- PreCOMET-cons
- PreCOMET-diversity
- PreCOMET-avg
- PreCOMET-var
- PreCOMET-diffdisc_direct
- PreCOMET-diff
- PreCOMET-disc
The model usage is described on Huggingface.
Briefly, install the PreCOMET package:
pip install pip3 install git+https://github.com/zouharvi/PreCOMET.gitthen:
import precomet
model = precomet.load_from_checkpoint(precomet.download_model("zouharvi/PreCOMET-diversity"))
model.predict([
{"src": "This is an easy source sentence."},
{"src": "this is a much more complicated source sen-tence that will pro·bably lead to loww scores 🤪"}
])["scores"]
> [25.921934127807617, 20.805429458618164]For PreCOMET-diversity, segments with lower scores are better for evaluation because they lead to different system translations.
Other
This work is described in How to Select Datapoints for Efficient Human Evaluation of NLG Models?.
Cite as:
@misc{zouhar2025selectdatapointsefficienthuman,
title={How to Select Datapoints for Efficient Human Evaluation of NLG Models?},
author={Vilém Zouhar and Peng Cui and Mrinmaya Sachan},
year={2025},
eprint={2501.18251},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2501.18251},
}