Skip to content

[Enhancement]: Chatbot Improvement Refinements #198

@baranwalayush

Description

@baranwalayush

Is there an existing issue for this?

  • I have searched the existing issues

Issue Summary

Referenced Folder: Chatbot
While reviewing the codebase, I found some possible improvements that can make the chatbot model perform better. This proposal aims to enhance the chatbot's stability, generalization, and performance by standardizing the training and deployment processes.

  1. NLP Standardization
    The project uses very simple and naive tokenizer and stemmer. We can surely make it better by using the standard ones.

    • Use nltk.word_tokenize for superior tokenization.
    • Use PorterStemmer for effective stemming.
  2. Training Refinements (Efficiency & Monitoring)

    • Logging: Calculate and print the Average Epoch Loss (instead of just the final batch loss).
    • Performance: Set num_workers > 0 in the DataLoader for parallel data fetching.

If I find any more possible improvements, I will update the PR accordingly.

Record

  • I have synced all my node versions as mentioned in the project
  • I am using the same version of npm as is the project
  • My current branch is in sync with the development branch
  • I want to work on this issue

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions