WIKIEVENTS is a benchmark dataset (English) built in 2021 for the document-level event extraction task.
It has complete events and relative annotations on 246 documents.
The event annotation task followed established ontology from the KAIROS project.
The number of events types in training, development, and testing sets are 49, 35, and 34, respectively.
And the number of annotated sentences are 5262, 378, and 492, respectively.
Compared to ACE, the WIKIEVENTS dataset has a much richer event ontology, especially for argument roles.