ESCI_Ranking

Dataset

ESCI Dataset: a large dataset of difficult search queries, released with the aim of fostering research in the area of semantic matching of queries and products. For each query, the dataset provides a list of up to 40 potentially relevant results, together with ESCI relevance judgements (Exact, Substitute, Complement, Irrelevant) indicating the relevance of the product to the query. This repository focuses on the english sample points of the dataset.

Note: For comparison of the models on the NDCG@10 Criterion -

Exact - 1.0
Substitute - 0.1
Complimentary - 0.01
Irrelevant - 0.0

For compute: Took a subset of the ESCI Dataset of 50,000 sample points for training and 10,000 sample points for for evaluating.

Introduction

The purpose of this task is to explore more on the topic of multitask learning in the scope of E-Commerce. Product recommendation and Relevance calculation were the task set for evaluation. For each query, there are a number of product recommendations along with their labels.

Model Architecture

Explored bi-encoders and cross-encoders for this task. Bi-Encoders, are fast and less computationally intensive but Cross-Enocders are more accurate.

Loss Function

Loss function for this project was chosen to be a weighted average of RCR Loss and BCE Loss. RCR Loss itself is a weighted average of MSE Loss and ListCE Loss.

Loss = (1 - x) * RCR + x * CE

Loss = (1 - x) * alpha * MSE Loss + (1 - x) * (1 - alpha) ListCELoss + x * CE Loss

Evaluation

The best approach (Cross Encoder) on the initial dataset achieved a NDCG@10 = 0.9196.

The best approach (Cross Encoder) on the refined dataset achieved a NDCG@10 = 0.9006

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Refined Approach		Refined Approach
ESCI_BiEncoder.ipynb		ESCI_BiEncoder.ipynb
ESCI_CrossEncoder.ipynb		ESCI_CrossEncoder.ipynb
ESCI_CrossEncoder_Multitask_Loss.ipynb		ESCI_CrossEncoder_Multitask_Loss.ipynb
ESCI_DatasetCreation.ipynb		ESCI_DatasetCreation.ipynb
ESCI_Nash_MTL.ipynb		ESCI_Nash_MTL.ipynb
EWC_Nash_MTL.ipynb		EWC_Nash_MTL.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ESCI_Ranking

Dataset

Introduction

Model Architecture

Loss Function

Evaluation

About

Uh oh!

Releases

Packages

Languages

b22cse040/ESCI_Ranking

Folders and files

Latest commit

History

Repository files navigation

ESCI_Ranking

Dataset

Introduction

Model Architecture

Loss Function

Evaluation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages