• vi-VNen-GB



On July 28, 2010, in Hanoi, the Institute of Information Technology - Hanoi National University and VIEGRID Communication & Technology JSC in collaboration with VietNamNet held a press conference to announce the report on The Current State of The Vietnamese in Text.

In this current stage of development, Vietnam's society is facing with many new challenges. Language is a precious cultural property which associates with the historical development of each nation. Preserving the purity of Vietnamese and enriching the Vietnamese are always responsibility of each Vietnamese.

It is expected that Vietnamese community, especially young generation, together preserves and protects Vietnamese Language. Therefore, The Institute of Information Technology - Hanoi National University, in coordination with VIEGRID Communication & Technology JSC conducted scientific surveys and researches and evaluation on the spelling in Vietnamese text with a slogan being "Join us in preserving the Vietnamese language".

General Director of VIEGRID - Mrs. Le Ngoc Hong said: "We always want to contribute to the community with practical and meaning work. For years, we have developed software pieces to process Vietnamese. Thus, together with the Institute of IT and linguists, we launch this report to raise the first bell on the popularity of misspelling and to call for a strong war against this social issue."

E-newspaper VietnamNet will be one of the first press agencies to lead this community campaign, to recognize the issue seriously, and to join with society to preserve Vietnamese.

Before evaluating the quality of spelling in Vietnamese text, a small survey was conducted on two groups of linguists and IT Professionals: The group of Language Professionals requested that the spelling error rate in Vietnamese text should be less than 1%. The group of IT professionals accepted this rate of about 2.5 - 5%. Both groups agreed that the sector of press and media should be most responsible for the state of Vietnamese spelling. The majority of professionals agreed that the rate of 10% was alarming threshold for spelling error, and 30% was the threshold that a misspelling has become accepted as a new correct spelling.

In the ranking in June 2010, 177 units were evaluated on spelling errors and 132 units in seven sectors were ranked (on the web page xephangvanban.com)

1) Ministry and Central Office;

2) People's Committees of the provinces and cities directly under the Central Government;

3) Government and Ministry agencies;

4) Universities and Research Institutes;

5) Press, publishers and media agencies;

6) Vietnamese enterprises;

7) Foreign Organizations and Agencies in Vietnam.

A statistic was made on 67.000 samples. The statistic method, basing on the typical error file, is suitable with the condition of limited resources. The statistics showed that the average rate of misspelling of Vietnamese text was 7.79 %, higher than the minimum requirement.

The result showed that the words with highest error rate were "soi mói" with 74, 33 %, "Sáng lạn" with 41, 66 %, "cọ sát" with 28, 38 %, "thăm quan" with 20, 61 %.

The sector of press and communication had the highest spelling error rate, nearly at alarming rate of 10 %.

The error rate at the sector of Universities and Research Institutes was approximately to the average rate of society, not matching with its role as a model and pioneer in correct word using.

In particular, both sectors had their representative with error rate over 30%.

The sector of local governments, and agencies under the Ministry also had relatively high spelling error rate. In particular, there were units with error rate nearly 40 %.

Even that better sectors of enterprises and Ministries should further improve themselve in order to achieve the standard rate of 1%.

The detailed evaluation results are published on the Website www.xephangvanban.com

The results above reflect an alarming state of Vietnamese spelling. The group of authors, through this work, expects to make the whole society and ranked units understand the importance of Vietnamese spelling issue. The evaluation will be conducted once for every 3 months and will continue to be expanded in scale to support a public campaign on scanning misspelling.

The introduction of Vietnamese spelling checking software pieces on the website www.xephangvanban.com is necessary for the campaign. Basing on the statistical analysis, the group of authors estimated that the rate between non-word error and the substantive error was 31.69%: 68.31 %. Contrary to conception of some IT professionals, and different from English, in Vietnamese, substantive error is the major one. That may explain why the software pieces which can not scan substantive errors do not get a strong response from users.

For objective evaluation, the authors used criteria such as recognition decree, accuracy decree and the ability to give suggestion to evaluate performance of error checking software piecies. The group of experts also used the measurements VIE - a measurement which considers above factors and the ratio between non-word errors and substantive errors.

In the future, businesses, professionals and users can introduce new products to community to improve Vietnamese on the Web site www. xephangvanban.com

Any solo attempts, despite its great effort, will not bring the results as desired. Through this work, the group of authors expects to have the participation of managers, linguists and cultural activists in this campaign, and hopes that there are more software products to server community. JOIN US IN PRESERVING THE VIETNAMESE LANGUAGE.

Thanks and best regards!