Skip to content

Commit 75c12dc

Browse files
committed
update guidelines
1 parent 6c5497a commit 75c12dc

3 files changed

Lines changed: 35 additions & 22 deletions

File tree

Lines changed: 5 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -1,18 +1,11 @@
11
<div class="row">
22
<div class="col-md-12">
3-
<p>Assess the translation quality on the 10-point scale using quality levels as follows:</p>
4-
5-
63
<ul class="list-unstyled">
7-
<li><strong>Not acceptable (1-2)</strong>: The translation is misleading or incomprehensible, with severe meaning errors and broken language that make it unusable.</li>
8-
<li><strong>Borderline (3-4)</strong>: The text conveys some meaning but contains frequent errors and awkward phrasing; it requires heavy editing to be useful.</li>
9-
<li><strong>Acceptable (5-6)</strong>: The message is clear and mostly accurate, though minor issues in grammar or style remain; suitable with light to moderate editing.</li>
10-
<li><strong>Good (7-8)</strong>: The translation is accurate and fluent with only rare awkwardness; tone and style are appropriate, needing minimal polishing.</li>
11-
<li><strong>Excellent (9-10)</strong>: The text reads like a native-level, publication-ready piece: fully accurate, idiomatic, stylistically polished, and coherent.</li>
4+
<li><strong>Very Good (9–10)</strong>: Complete meaning transfer. Sounds perfectly natural. (no or minimal proofreading)</li>
5+
<li><strong>Good (7–8)</strong>: Near‑complete transfer, minor inaccuracies. Mostly natural, minor awkwardness. (needs light proofreading)</li>
6+
<li><strong>Acceptable (56)</strong>: Main ideas conveyed but with noticeable inaccuracies or omissions. Uneven naturalness, awkward phrasing. (usable only after substantial revision)</li>
7+
<li><strong>Borderline (3–4)</strong>: Partial transfer; frequent misinterpretation or omission that confuses the message. Often unnatural. (requires major rewrite)</li>
8+
<li><strong>Not acceptable (1–2)</strong>: Violation of meaning; large portions mistranslated, missing, or incoherent. (unusable without complete retranslation)</li>
129
</ul>
13-
14-
{% if doclvl %}
15-
<p>Note that sentences in each paragraph can be separated by the <code>&lt;eos&gt;</code> or <code>&lt;br/&gt;</code> tags for convenience and this should not impact your assessment.</p>
16-
{% endif %}
1710
</div>
1811
</div>

EvalView/templates/EvalView/pairwise-assessment-document.html

Lines changed: 11 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -25,9 +25,10 @@
2525
.quotelike .row, .pseudoquotelike .row { font-size: 16px; margin: 0; padding: 10px 20px; }
2626
.quotelike.active { background-color: #f7f7f7; }
2727

28-
.question-box { margin-bottom:10px; }
29-
.question-box p { font-size: 120%; margin: 10px 0 20px; font-style: italic; color: #31708f; }
30-
.question-box li { font-size: 100%; font-style: italic; color: #31708f; }
28+
.question-box { margin:20px 0; }
29+
.question-box p { font-size: 120%; margin: 10px 10px 0; color: #555; }
30+
.question-box ul { margin: 10px; }
31+
.question-box li { font-size: 100%; color: #555; }
3132
.document-box { padding-top: 10px; }
3233
.document-box p { font-size: 16px; }
3334

@@ -864,13 +865,13 @@
864865
{% if trusted_user %}
865866
<span class="glyphicon glyphicon-ok-sign" aria-hidden="true"></span>
866867
{% endif %}
867-
{{completed_blocks}}/{{total_blocks}} documents,
868+
{{completed_blocks}}/{{total_blocks}} screens,
868869
<span id="items-left-counter">{{items_left_in_block}}</span>
869-
items left in document
870+
items left on this screen
870871
</span>
871872
</td>
872873
<td style="width:33%;text-align:center;">
873-
{{campaign}}:#{{datask_id}}:Document #{{document_id}}-<span id="current-item-id">{{item_id}}</span>
874+
{{campaign}}:{{datask_id}}:{{document_id}}-<span id="current-item-id">{{item_id}}</span>
874875
</td>
875876
<td style="width:33%;text-align:right;">
876877
{% if source_language %}{{source_language}} &rarr; {% endif %}{{target_language}}
@@ -879,11 +880,11 @@
879880
</table>
880881
</div>
881882

882-
<div class="question-box">
883+
<div class="question-box alert alert-success">
883884
<div class="row">
884885
<div class="col-sm-12">
885886
{% for text in priming_question_texts %}
886-
<p>{{text|safe}}</p>
887+
{{text|safe}}
887888
{% endfor %}
888889
{% if scalar_slider %}
889890
{% with mono=monolingual doclvl=doc_guidelines %}
@@ -1121,9 +1122,9 @@
11211122

11221123
{% else %}
11231124

1124-
<div class="question-box">
1125+
<div class="question-box alert alert-success">
11251126
<div class="row">
1126-
<div class="col-sm-12 question-box">
1127+
<div class="col-sm-12">
11271128
{% for text in document_question_texts %}
11281129
<p>{{text|safe}}</p>
11291130
{% endfor %}

EvalView/views.py

Lines changed: 19 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2391,6 +2391,25 @@ def pairwise_assessment_document(request, code=None, campaign_name=None):
23912391
'in {0} (right column) convey the original semantics of the source document '
23922392
'in {1} (left column)? '.format(target_language, source_language),
23932393
]
2394+
2395+
# new guidelines
2396+
priming_question_texts = [
2397+
'<p>'
2398+
f'Below is a document in {source_language} presented sentence by sentence. Each source sentence has been translated by two different systems, A and B, into {target_language}. '
2399+
'Your task is to rate each translation using the scale below, based on three criteria: <br/>'
2400+
'</p>'
2401+
'<p>'
2402+
f'<strong>Naturalness</strong>: Does the translation sound fluent in {target_language}?<br/>'
2403+
f'<strong>Accuracy</strong>: Does the translation correctly preserve the meaning of the source text?<br/>'
2404+
f'<strong>Coherence</strong>: Does the sentence translation fit well in the document context?<br/>'
2405+
'</p>'
2406+
'<p><em>Note</em>: You first evaluate each sentence individually, but you should use the entire translation as context.</p>'
2407+
'<p>Rating scale:</p>'
2408+
]
2409+
document_question_texts = [
2410+
'For the final step, please look again at each translated document. '
2411+
'Provide one final, overall rating for each translation candidate, judging it as a whole. '
2412+
]
23942413

23952414
monolingual_task = 'monolingual' in campaign_opts
23962415
use_sqm = 'sqm' in campaign_opts

0 commit comments

Comments
 (0)