Running SLTev on ESIC (interpretation corpus)

Hi

We are trying to test text-to-text translation on the ESIC corpus. The problem is that ESIC is document aligned, but not sentence aligned. The documents are segmented, but the number of segments does not match between source and target, so SLTev throws an error. Yet it states in the documentation that  "segmentation can differ from the reference one"

How should this case be handled in SLTev?

```
evaluation for  en.OSt.man.orto.txt.slt  failed, the number of Complete lines (C) in  en.OSt.man.orto.txt.OStt  and  en.IStde.man.orto.txt  are not equal
The number of C segment (complete) in en.OSt.man.orto.txt.OStt is 2693 and number of lines in en.IStde.man.orto.txt is 2900
```

best
Barry




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Running SLTev on ESIC (interpretation corpus) #67

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Running SLTev on ESIC (interpretation corpus) #67

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions