Installation

This repository performs provides an experiment for the generation of recaps from the BOOKSUM (Krysćiński et al. 2022) dataset.

Installation

Please clone the repository with the command git clone --recurse-submodules git@github.com:jecGrimm/Recap.git.

If the repository has been cloned via git clone git@github.com:jecGrimm/Recap.git, please run git submodule update --init --recursive to install the needed submodules.

We provide a conda environment which can be installed with the command conda env create -f environment.yml.

To run the compute the SPICE metric please download Stanford CoreNLP 3.6.0 and add the files stanford-corenlp-3.6.0.jar and stanford-corenlp-3.6.0-models.jar to the directory CaptionMetrics/pycocoevalcap/spice/lib/.

Experiment Setup

We test compare an NER model, SBERT, and Gemma-2-2b-it for the generation of recaps from BOOKSUM (Krysćiński et al. 2022). A recap is a summary of previous content which is important for coming content.

For the NER model and SBERT, we map the last chapter summaries to the second-to-last chapter summaries. For NER, we extract the sentences from the second-to-last chapter summary that contain at least one Named Entitiy in the last chapter summary. For SBERT, we extract the sentences with cosine similarity > 0.1 with the last chapter summary. Gemma-2-2b-it is prompted to generate recaps for the booktitles in BOOKSUM (Krysćiński et al. 2022).

We evaluate the generated recaps with BLEU-1, ROUGE-L, and SPICE. For further analysis, we create figures that show the positions and the sources of the kept sentences.

Structure

CaptionMetrics

This directory contains a modified version of the repository wangleihitcs/CaptionMetrics. It is used to compute the ROUGE-L and SPICE scores.

data

This directory contains the dataset files with the original summaries of BOOKSUM (Krysćiński et al. 2022). Each instance maps one last chapter summary to its second-to-last chapter summaries. Keys:
recap_id: ID of the instance, the format is {<book_id>_<next_source>}
bid: ID of the book
previous_summary_id: list of the second-to-last chapter summary ids
previous_summary: list of the second-to-last chapter summaries
previous_source: list of the second-to-last chapter summary sources
next_summary_id: last chapter summary id
next_summary: last chapter summary
next_source: source of the last chapter summary

evaluation

This directory contains files with the evaluation metrics from the performed experiments.

notebooks

This directory contains the notebook for the LLM recap generation. Gemma-2b-2-it needs a GPU to run.

recaps

This directory contains the recaps generated by the examined approaches.

visualizations

This directory contains the figures for the analysis of the kept sentences.

scripts

analyze.py

This script provides functions for the creation of the figures for the analysis.

data.py

This script contains the class RecapData which maps the chapter summaries from BOOKSUM (Krysćiński et al. 2022) and creates the baseline and gold recaps. The last chapter summaries serve as gold references, the second-to-last summaries as baseline.

develop.py

This script develops the tresholds for the NER model and for SBERT on the validation split.

evaluate.py

This script provides functions to perform the evaluation of the generated recaps. We compute BLEU-1, ROUGE-L, SPICE scores.

experiment.py

This script generates recaps for all examined approaches on the test split and evaluates them.

llm.py

This script generates recaps with Gemma-2b-2-it. Please note that a GPU is needed to run the script!

ner.py

This script contains the class NER which can generate extractive recaps via NER matching.

similarity.py

This script contains the class SentenceSimilarity which can generate extractive recaps via cosine similarity.

References

Krysćiński, Wojciech, Nazneen Rajani, Divyansh Agarwal, Caiming Xiong, and Dragomir Radev. 2022. “BOOKSUM: A Collection of Datasets for Long-Form Narrative Summarization.” In Findings of the Association for Computational Linguistics: EMNLP 2022, edited by Yoav Goldberg, Zornitsa Kozareva, and Yue Zhang, 6536–58. Abu Dhabi, United Arab Emirates: Association for Computational Linguistics. https://doi.org/10.18653/v1/2022.findings-emnlp.488.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation

Experiment Setup

Structure

CaptionMetrics

data

evaluation

notebooks

recaps

visualizations

scripts

analyze.py

data.py

develop.py

evaluate.py

experiment.py

llm.py

ner.py

similarity.py

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.vscode		.vscode
CaptionMetrics @ 0a2efbf		CaptionMetrics @ 0a2efbf
__pycache__		__pycache__
data		data
evaluation		evaluation
notebooks		notebooks
recaps		recaps
visualizations		visualizations
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
analyze.py		analyze.py
data.py		data.py
develop.py		develop.py
environment.yml		environment.yml
evaluate.py		evaluate.py
experiment.py		experiment.py
llm.py		llm.py
ner.py		ner.py
similarity.py		similarity.py
test.txt		test.txt

Folders and files

Latest commit

History

Repository files navigation

Installation

Experiment Setup

Structure

CaptionMetrics

data

evaluation

notebooks

recaps

visualizations

scripts

analyze.py

data.py

develop.py

evaluate.py

experiment.py

llm.py

ner.py

similarity.py

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages