-
Notifications
You must be signed in to change notification settings - Fork 173
Open
Description
Is there an existing issue for this?
- I have searched the existing issues
Issue Summary
Referenced Folder: Chatbot
While reviewing the codebase, I found some possible improvements that can make the chatbot model perform better. This proposal aims to enhance the chatbot's stability, generalization, and performance by standardizing the training and deployment processes.
-
NLP Standardization
The project uses very simple and naive tokenizer and stemmer. We can surely make it better by using the standard ones.- Use nltk.word_tokenize for superior tokenization.
- Use PorterStemmer for effective stemming.
-
Training Refinements (Efficiency & Monitoring)
- Logging: Calculate and print the Average Epoch Loss (instead of just the final batch loss).
- Performance: Set num_workers > 0 in the DataLoader for parallel data fetching.
If I find any more possible improvements, I will update the PR accordingly.
Record
- I have synced all my node versions as mentioned in the project
- I am using the same version of npm as is the project
- My current branch is in sync with the development branch
- I want to work on this issue
Metadata
Metadata
Assignees
Labels
No labels