MaskedWiki dataset and attached models are described in the following publication: https://arxiv.org/abs/1905.06290 The code (together with instructions) to run the models and utilize the data can be found here: https://github.com/vid-koci/bert-commonsense The MaskedWiki_downsampled was used in the above paper. MaskedWiki_full was not used, however, we make it available for anybody who might find MaskedWiki_downsampled too small. The format of the dataset follows the format of the Definite Pronoun Resolution dataset (Rahman and Ng, 2012): Each example is given in 5 lines. The first line is the sentence, with one noun replaced with [MASK]. The second line is [MASK] (the word that has to be replaced). The third line contains both candidates, separated with a comma. Note that the order of the candidates is NOT guaranteed to be random. The fourth line contains the correct candidate. The fifth line is empty.