-
Notifications
You must be signed in to change notification settings - Fork 141
Open
Description
Hello I am currently running the command below with traceback shown.
Do I need to start over and delete enwiki-latest-pages-articles.xml? I've had to start and stop this script once because I was not in the correct environment where I had tensorflow and could not find wikidata_linker_utils. But once I was in the correct environment it got past that issue.
My ultimate goal is to simply run this notebook (https://github.com/openai/deeptype/blob/master/learning/SentencePredictions.ipynb)
So I assume i need to download the data and train the model.
Any help would be appreciated. Thank you.
sh extraction/full_preprocess.sh ${DATA_DIR} en
Downloading wikidata into data/.
Will prepare language: en
Creating data directory
Done.
Downloading and preparing Wikidata:
Already compressed Wikidata
Done with wikidata.
Preparing language: en
Already downloaded and extracted enwiki-latest-pages-articles.xml.
Process Process-9:
Traceback (most recent call last):
File "/home/ec2-user/anaconda3/envs/tensorflow_p36/lib/python3.6/multiprocessing/process.py", line 258, in _bootstrap
self.run()
File "/home/ec2-user/anaconda3/envs/tensorflow_p36/lib/python3.6/multiprocessing/process.py", line 93, in run
self._target(*self._args, **self._kwargs)
File "/home/ec2-user/anaconda3/envs/tensorflow_p36/lib/python3.6/site-packages/epub_conversion/wiki_decoder.py", line 323, in convert_wiki_to_lines_inner_queue
for res in convert_wiki_to_lines_inner_generator(wiki, *args):
TypeError: 'NoneType' object is not iterable
Metadata
Metadata
Assignees
Labels
No labels