I run your code, while the following message doesn't appear.
[07 16:30:54 125@factory.py:/data/vision/billf/scratch/jiayuanm/projects/NSCL-PyTorch/nscl/datasets/factory.py] Filtering out questions containing "how big" and "made of", #before = 699989, #after = 633615.