-
Notifications
You must be signed in to change notification settings - Fork 356
Downloads
You can download a pre-built list of all the edges (assertions) in ConceptNet 5.7 in this gzipped, tab-separated text file.
As an example, here's the first line in the file (an Abkhaz word and its antonym):
/a/[/r/Antonym/,/c/ab/агыруа/n/,/c/ab/аҧсуа/] /r/Antonym /c/ab/агыруа/n /c/ab/аҧсуа {"dataset": "/d/wiktionary/en", "license": "cc:by-sa/4.0", "sources": [{"contributor": "/s/resource/wiktionary/en", "process": "/s/process/wikiparsec/1"}], "weight": 1.0}
The five fields of each line are:
- The URI of the whole edge
- The relation expressed by the edge
- The node at the start of the edge
- The node at the end of the edge
- A JSON structure of additional information about the edge, such as its weight
Would you like to try improving on the parser that created ConceptNet from the Open Mind Common Sense data? Here's a tab-separated table containing the original text.
- Text that someone probably typed as free text (or if they filled in a template, we didn't keep track of that fact): https://s3.amazonaws.com/conceptnet/downloads/2018/omcs-sentences-free.txt
- Text from various sources, including free text, templates, games, responses to questions, and so on: https://s3.amazonaws.com/conceptnet/downloads/2018/omcs-sentences-more.txt
Remember: this data comes from various forms of crowdsourcing. Sentences in these files are not necessarily true, useful, or appropriate.
ConceptNet also provides pre-computed downloadable term vectors (a replacement for word2vec or GloVe, for example), called ConceptNet Numberbatch. You can find links to ConceptNet Numberbatch downloads in its own repository.
Starting points
Reproducibility
Details