GitHunt
ZO

zouharvi/PreCOMET

PreCOMET

The PreCOMET is a fork of Unbabel/COMET from version 2.2.4.
All original licensing applies.

It is used in the subset2evaluate package.
The divergent fork was created such that this package does not conflict with the original one.

Models

The models are available on Huggingface:

The model usage is described on Huggingface.
Briefly, install the PreCOMET package:

pip install pip3 install git+https://github.com/zouharvi/PreCOMET.git

then:

import precomet
model = precomet.load_from_checkpoint(precomet.download_model("zouharvi/PreCOMET-diversity"))
model.predict([
  {"src": "This is an easy source sentence."},
  {"src": "this is a much more complicated source sen-tence that will pro·bably lead to loww scores 🤪"}
])["scores"]
> [25.921934127807617, 20.805429458618164]

For PreCOMET-diversity, segments with lower scores are better for evaluation because they lead to different system translations.

Other

This work is described in How to Select Datapoints for Efficient Human Evaluation of NLG Models?.
Cite as:

@misc{zouhar2025selectdatapointsefficienthuman,
    title={How to Select Datapoints for Efficient Human Evaluation of NLG Models?}, 
    author={Vilém Zouhar and Peng Cui and Mrinmaya Sachan},
    year={2025},
    eprint={2501.18251},
    archivePrefix={arXiv},
    primaryClass={cs.CL},
    url={https://arxiv.org/abs/2501.18251}, 
}
Apache License 2.0
Created December 31, 2024
Updated May 30, 2025
zouharvi/PreCOMET | GitHunt