Loading...

XML

Word

Printable

Details

Type: Improvement
Resolution: Fixed
Priority: Major
Component/s: Platform
Labels:
None

Similar issues:

Description

The collection reader tries to guess automatically the language a text is written in.

However this might lead to erroneous results when the text is too short. The typical "error" is that a document that is known being in a given language is not analyzed by any annotator because it is recognized as being written in another language. And when we see this behavior we tend to think that there's a bug in the framework.

Attachments

Activity

People

Assignee:: Eduard Moraru

Reporter:: Fabio Mancinelli

Votes:: 0 Vote for this issue

Watchers:: 0 Start watching this issue

Dates

Due:: 11/Oct/10

Created:: 01/Oct/10 13:55

Updated:: 11/Oct/10 20:37

Resolved:: 11/Oct/10 20:37

Date of First Response:: 11/Oct/10 8:37 PM