Welcome
This is the online resources site of SiSTeC and its related technologies.
What is SiSTeC?
SiSTeC stands for SiStem Terjemahan berasaskan SSTC (SSTC-based Translation System) [1]. It consists of a set of natural language processing (NLP) tools, based on the flexible Structured String-Tree Correspondence (SSTC) annotation schema [2,3] and related tools.
Following are some example NLP tools and applications developed with the SiSTeC framework:
- SiSTeC-ebmt
- An example-based machine translation (EBMT) engine. Current language pairs include English-Malay (bidirectional) and English-Chinese. Other language pairs are currently under development, in collaboration with various universities.
- SiSTeC-Editor
- A GUI frontend for examining and editing translation examples annotated as Synchronous SSTC (S-SSTC), which form the basis of SiSTeC-ebmt's bilingual knowledge bank.
- SiSTeC-CATT
- A lightweight translation workbench, in which human translators can post-edit translation outputs from SiSTeC-ebmt, and for looking up translation memories and terminiology lists.
Contact
- Tang Enya Kong, Ph.D. (moc.liamg|1gnokayne#moc.liamg|1gnokayne)
- for further enquiries about SiSTeC
- Lim Lian Tze, Ph.D. (moc.liamg|eztnail#moc.liamg|eztnail)
- for information about this webpage.
Bibliography
1. Boitet, C., Zaharin, Y. & Tang, E. K. (2011). Learning-to-translate based on the S-SSTC annotation schema. In Proceedings of PACLIC 2011. Singapore.
2. Boitet, C. & Zaharin, Y. (1988). Representation Trees and String-Tree Correspondences. In Proceedings of the 12th International Conference on Computational Linguistics, vol. 1, pp. 59—64.
3. Al-Adhaileh, M., Tang, E. K. & Zaharin, Y. (2002). A synchronization structure of SSTC and its applications in machine translation. In COLING 2002 Post-Conference Workshop “Workshop on Machine Translation in Asia”. Taipei, Taiwan.
page revision: 18, last edited: 28 May 2015 14:02