AD Research Wiki:

Freebase RDF Dump

The last dump of Freebase that was available:

/nfs/raid2/haussmae/freebase/latest

ClueWeb 12 Full

/nfs/raid1/clueweb12

Elmar extracted sentences containing entities from FACC1:

/nfs/raid1/haussmae/clueweb_extracted_sentences

Elmar also parsed the sentences using SpaCy and identified binary and mediated relations in Freebase between FACC1 entities

/nfs/raid5/haussmae/keyword-translation-data/clueweb_sentences

Elmar also filtered the above to only questions:

/nfs/raid5/haussmae/qa-completion-data/questions

ClueWeb 09 B

/nfs/raid1/clueweb09_b

FACC 1

Google's entity annotations on ClueWeb 09 and 12

/nfs/raid2/haussmae/freebase_annotations

Other corpora

/nfs/raid2/haussmae/datasets

AD Research Wiki: Datasets (last edited 2016-07-29 17:14:08 by Hannah Bast)