-
中国英语学习者语料库
CLEC
收集了包括中学生、大学英语
4
级和
6
级、专业英语低年级和高年级在内
的
5
种学生的语料一百多万词,
并对言语失误进行标注。
其目的就是观察各类学
生的英语特征和言语失误的情况,
希望通过定量和定性的方法对中国学习者英语
作出较为精确的描写,为
我国学生的英语教学提供有用的反馈信息。
表
1
CLEC
语料分布
类型
词次
ST2
208088
ST3
209043
ST4
212855
ST5
214510
ST6
226106
总计
1070602
言语失误标注
原则
1.
简单合理,易于系统操作。参与标注的人比较多,分类表过于繁复,就
< br>难于掌握。我们采取两级分类,第一级有
11
类:词形(
fm
)、动词短语(
vp
)、
名词短语
(
np
)
、
代词
(
pr
)
、
形容词短语<
/p>
(
aj
)
、
p>
副词
(
ad
)
p>
、
介词短语
(
pp
)
、
连词(
c
j
)、词汇(
wd
)、搭配(
cc
)、句子(
sn
)
。每一类里再用数目字细分。
如
[cc]
为词语搭配不当,
[cc1]
表示名词和名词的搭配,
[cc2]
表示名词和动词的
搭配,
[cc3]
表示动词和名词的搭配,等等。
2.
分类表的类别要适中。过粗容易统一,但信息太少,不利于分析学习者
的失
误
/
过细难以统一,容易把同一种失误归到不同类别。目前我们
采取的办法
是对常见的失误从细(如
vp
和
np
都有
9
小类),对少见的失误从粗(如
cj
只
有两小类)。现在的分类表有
61
个失误码,是属于中等规模
的分类表。
提供足够的失误信息<
/p>
(失误本身、
失误类型和失误发生范围)
。
例如
In
the
past,
people
are
[vp6, 4-
]
kind to each other…,
失误用方括号表示,放在失误
之后。
[vp6]
为
vp
(动词)第
6
种(时态)失误,
4-
为失误发生的范围,<
/p>
-
表示
失误的位置,
4
表示失误前有
4
个词。要联系这
4
个词,才能判断
are
这个词用
错了。
开放性。
容许研究者根据需要对失误类型进行补充或进一步再分出细类
。
例如
[sn8]
为句子结构有缺陷,
研究者可以对这种失误再分为若干细类来研究。这需
要把
sn8
的失误全部检索出来,然后定出第三级的分类范畴,如
sn81
,
sn82
,
等等。
5.
p>
对语体或失误的来由暂不作标注,因为这需要标注者较多的主观判断,更
难以统一。
言语失误分类表(总数:
61
)
词形
码
类型
码
vp1
vp2
vp3
vp4
vp5
vp6
vp7
vp8
vp9
动词短语
类型
pattern
set
phrase
agreement
non-finite
tense
voice
mood
码
np1
np2
np3
np5
np6
np7
np8
名词短语
类型
pattern
set
phrase
agreement
case
countability
number
article
quantifiers
other
determiners
形容词短语
码
类型
pattern
degree
-ed/-ing
confusion
aj5
predicative
/attributive
词语
码
类型
order
码
cc1
搭配
类型
noun/noun
noun/verb
verb/noun
adj/noun
verb/adv
adv/adj
码
sn1
sn2
sn3
sn4
sn5
sn6
sn7
句子
类型
run-on
sentence
wd2
wd3
wd4
wd5
wd6
wd7
part of speech
cc2
substitution
absence
redundancy
repetition
ambiguity
cc3
cc4
cc5
cc6
sentence
fragment
dangling
modifier
illogical
comparison
topic
prominence
Coordination
Subordination
码
ad1
ad3
副词
类型
order
modification
degree
码
pp1
pp2
介词短语
类型
pattern
set
phrase
码
连词
类型
cj1
pattern
cj2
set phrase
码
代词
类型
pr1
Reference
pr2
anticipatory
it
fm3
capitalization
pr3
Agreement
pr4
Case
pr5
wh-
pr6
Indefinite
fm1
Spelling
fm2
word building
finite/non-finite
np4
modal/auxiliary
np9
aj1
aj2
aj3
aj4
set phrase
ad2
wd1
sn8
sn9
structural
deficiency
Punctuation
标注说明
码
分
类
类
别
说
明
fm1
word
Spelling
(拼写)
spelling, coinage, abbreviation,
apostrophe
fm2
word
word
building
derivation, inflection,
compounding,
(构词)
plurality
(noun),
irregularity(verb),
3rd
person singular form(verb),
syllabification, hyphenation, word
division or fusion
fm3
vp1
word
Capitali
zation
(大小写)
vb
phr
Pattern
(及物性
型式
)
lower
initial
letter
for
upper
initial
letter or vice versa
error
in
transitivity(
vi
as
vt
or
vice
versa),
transitive verb pattern/
grammatical(cf
Oxford
advanced
learner’s dictionary of
current
English
edited by A.
S. Hornby)
vp2
vp3
vp4
vp5
vb phr
set
phrase
(固定
词组)
vb phr
Agreement
(主谓
一致性)
vb phr
finite/non-
fini
te
(定式)
vb phr
non-
finite
(不定
式)
phrasal verb and verbal phrase: error
in form or use
number
agreement with its subject
(noun or
pronoun)
finite
verb
for
non-finite
verb
or
vice
versa
infinitive error: form
and use/
infinitive for participle or
vice
versa/ -ed participle for -ing
participle or vice versa
error in tense use within a sentence/
the sequence of tenses between
sentences
error in the use
of voice: active for
passive or vice
versa
error in the use of mood:
imperative,
subjunctive/ improper
structure of
vp6
vb phr
Tense
(时态)
vp7
vp8
vb phr
voice
(语态)
vb phr
Mood
(语气)
conditional sentences
vp9
vb phr
modal/auxiliary
misuse
of
modal/auxiliary
verbs/
wrong
(情态)
form of
modal verb(or auxiliary verb)
and verb
combination (e.g tense form,
voice
form, etc)
np1
nn phr
Pattern
(名词型
Error
in combination with other
式)
words/grammatical
np2
nn phr
set
phrase
(固定
omission
or replacement of a fixed
词组)
element
that
goes
after
a
certain
noun
np3
nn phr
Agreem
ent
(主谓
number agreement of a
noun with its
一致性)
determiner or
a word that
refers to it
np4
nn phr
Case
(格)
possessive case error: form or use
np5
nn phr
Counta
bility
(可
uncountable noun
used as countable
数性)
noun
np6
nn phr
Number
(数)
countable
noun
used
with
no
determiner
or
-s
/
a
or
-s
with plural noun
np7
nn phr
Article
(冠词)
a/an
confusion
or
definite/indefinite
confusion
np8
nn
phr
Quantifiers
(数
misuse
or
confusion
between
many/much,
量词)
(a) few/(a)
little, some/any
, etc
np9
nn phr
other
misuse
or
confusion
of
demonstratives,
determiners
(其
wh-
determiners, numerals, etc.
他限定词)
pr1
pron
Reference
(指称)
incorrect/ambiguous pronoun
reference/anaphoric
pr2
pron
anticipatory
it
improper or wrong use of
anticipatory
(先行
it
)
it
/
it
replaced by a demonstrative,
etc
pr3
pron
Agreement
(主谓
number
agreement
with
a
noun
it
refers
一致性)
to
pr4
pron
Case
(格)
case error of any personal pronoun
pr5
pron
wh-
(
wh-
代词)
misuse or confusion of interrogative,
relative and conjunctive pronouns
pr6
pron
Indefini
te
(不定
misuse or confusion of
indefinite
式)
pronouns such as
all/both,
few/little,
some/any,
either/neither,
etc
aj1
adj
Pattern
(形容词
error
in the combination with other
型式)
words/grammatical
aj2
adj
set
phrase
(固定
error in
the idiomatic use of an
词组)
adjectival phrase/ omission or
replacement of a fixed element that
goes after a certain adjective
aj3
adj
Degree
(级)
adjective degree error: form and use
aj4
adj
aj5
adj
ad1
ad2
ad3
pp1
pp2
cj1
cj2
wd1
wd2
wd3
adv
adv
adv
prep
prep
conj
conj
word
word
word
-ed/-ing
confusion
(
-ed/-ing
混淆)
predicative/att
ributive
< br>(谓语
/
定语)
Order
(词序)
Modification
(修
饰语)
Degree
(级)
p>
Pattern
(介词型
式)
set
phrase
(
固定
词组)
Pattern
(连词型
式)
set
phrase
(固定
词组)
Order
(词序)
part of
speech
(词类)
Substi
tution
(替
代)
Absence
(缺少)
Redundancy
(冗
余)
< br>
Repetition
(重
复
)
Ambiguity
(歧义)
n/n collocation
(名词
/
名词)
n/v colloca
tion
(名词
/
动词)
v/n collocation
(动词
/
名词)
a/n c
ollocation
(形容词
/
名词
)
v/ad
collocatio
n
(动
词
/
副
词)
ad/a
-ed
adjective for -ing adjective or
vice
versa
predicative adjective used as
attributive adjective
improper adverb placement/wrong
position
adjective modifier
used as verb
modifier/ other kinds of
confusion
adverb degree error: form and
use
unacceptable combination with other
words/grammatical
error in
the formation or use of an
idiomatic
prepositional phrase
unacceptable
combination with other
words/grammatical
error in
the formation or use of a
phrase
functioning as a conjunction
misplacement
of
any
word
other
than
an
adverb
error
in
part
of
speech:
right
root
but
wrong
word class
error
in
word
choice:
right
word
class
but wrong selection (any part of
speech)
omission
of
a
word(any
part
of
speech)
oversuppliance of a word(any part of
speech)
unnecessary
repeating of a word
not clear
word meaning/semantic
improper
noun(phrase) and
noun(phrase)
combination/semantic
improper
noun(phrase) and
verb(phrase)
combination/semantic
improper verb and
noun(phrase)
combination/semantic
improper adjective and noun(phrase)
combination/semantic
improper verb and adverb (or ad/v)
combination/semantic
improper adverb and adjective
wd4
wd5
wd6
wd7
cc1
cc2
cc3
cc4
cc5
word
word
word
word
notiona
l
notiona
l
notiona
l
notiona
l
notiona
l
notiona
cc6
-
-
-
-
-
-
-
-
-
上一篇:代词的种类
下一篇:the way to rainy mountain单词