You can access the system without any limitations using the following endpoint:
http://www.serelex.org/MODEL/WORD. The list of available models is presented below:
You can download distributional thesauri (i.e. graphs of semantically related words) that are used as the basis of the current system in the CSV format: "word_i<TAB>word_j<TAB>similarity_ij", where "word_i" and "word_j" are words, such as ""Python" and "Ruby" and "similarity_ij" is their similarity score e.g. 0.789.
English PatternSim model extracted from the combination of ukWaC and Wikipedia corpora. You can also download raw PatternSim features derived from a larger corpus (a 59G combination of Wikipedia, ukWaC, GigaWord, and Leipzig news corpus).
If you would like to extract word similarity graphs from your own corpus you can use the PatternSim tool. Source code for evaluation of such graphs is also available (the sim-eval tool). Finally, you can run your own instance of the backend for improved performance if you use the API from your application.
PatternSim: a pattern-based semantic relatednes measure used to extract word graphs used in Serelex from text corpora.
Serelex lexical-semantic search engine (this web site).
sim-eval: tools for evaluation of semantic similarity/relatedness measures. More informaton about this tool.