I'm running a Lubuntu 16.04 Machine with gcc
installed. I'm not getting gensim
to work with cython
because when I train a doc2vec model
, it is only ever trained with one worker which is dreadfully slow.
As I said gcc
was installed from the start. I then maybe made the mistake and installed gensim
before cython
. I corrected that by forcing a reinstall of gensim
via pip
. With no effect still just one worker.
The machine is setup as a spark
master and I interface with spark
via pyspark
. It works something like this, pyspark
uses jupyter
and jupyter
uses python 3.5. This way I get a jupyter
interface to my cluster. Now I have no idea if this is the reason why i cant get gensim
to work with cython
. I don't execute any gensim code on the cluster, it is just more convenient to fire up jupyter
to also do gensim
.