Fill out this form to submit existing evaluation results on the released test set.
The leaderboard uses the F-score variant of ROUGE 1, 2, and L.
System scores should be computed on all article-summary pairs in the entire test set, across all extractiveness subsets.
(For a more complete evaluation, consider submitting a Docker image of your system for unreleased test data evaluation in the future!)
Fill out this form to submit a system for evaluation on the unreleased test set.
Systems must be submitted as Docker or GPU-enabled Docker containers (
Docker is used to minimize differences between your training environment and the test environment.
The NEWSROOM download tools include:
- A working Docker example for the TextRank summarization system.
- The CLI tool used to run submitted summarization systems (
- The CLI tool used to evaluate summaries of submitted systems (
These tools are provided to test systems prior to submission.