{"id":33657,"date":"2025-10-17T12:00:41","date_gmt":"2025-10-17T04:00:41","guid":{"rendered":"https:\/\/fass.nus.edu.sg\/srn\/?p=33657"},"modified":"2025-11-27T17:37:42","modified_gmt":"2025-11-27T09:37:42","slug":"tagging-singapore-english","status":"publish","type":"post","link":"https:\/\/fass.nus.edu.sg\/srn\/2025\/10\/17\/tagging-singapore-english\/","title":{"rendered":"Tagging Singapore English"},"content":{"rendered":"<p><span data-contrast=\"none\">Since its inception in 1988, the International Corpus of English (ICE) has been a cornerstone for research on World Englishes, comprising 14 countries\u2019 corpora from both the Inner Circle countries like Britain and the US to the Outer Circle countries like Singapore and the Philippines. Grammatically annotating the ICE corpora is a tall order due to limited resources and the need for human oversight. Part-of-speech (PoS) taggers, which are tools used to add linguistically relevant features like phonological and lexical annotation. For example, \u2018table\u2019 is labelled by the tag, \u2018noun\u2019. Modern PoS taggers are trained on data from Inner Circle English and can be used as cost-effective tools to tag Outer Circle English, though with lower accuracy. Regardless, their relatively high performance still makes it easier to check and correct the automatic tagging, easing what would otherwise be an extremely labour-intensive process. These corrected texts can then be used as a benchmark to improve PoS taggers, in turn making them more effective for processing Outer Circle English materials.<\/span><span data-ccp-props=\"{}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\">In \u2018Tagging Singapore English\u2019 (<\/span><i><span data-contrast=\"none\">World Englishes<\/span><\/i><span data-contrast=\"none\">, 2022), Bao et al. (NUS English, Linguistics and Theatre Studies) explored using the Stanford PoS tagger, trained on standard American English, to tag the Singaporean component of the ICE (ICE-SIN). Tagging ICE-SIN is part of a larger effort to build a tagged and parsed treebank on Singapore English. The researchers found that the Stanford PoS tagger achieved comparable accuracy rates in the more formal registers of ICE-SIN, where it attained 96% accuracy. This is similar to accuracy rates reported for British and American English. As expected, the accuracy was lower in the informal register of private conversations in ICE-SIN. The researchers partly attributed this reduced accuracy to contact-induced changes that are characteristic of Singapore English, including lexical and grammatical borrowings.<\/span><span data-ccp-props=\"{}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\">Lexical borrowings that were limited to Singapore and Malaysia, such as \u2018kiasu\u2019 and \u2018kopitiam\u2019, posed less of a challenge as they could be treated as regular words. However, grammatical borrowings, such as the sentence-final particles and novel uses of words like \u2018got\u2019, posed a greater problem by introducing an extra layer of structure or grammatical meaning not found in English morphosyntax. Properly tagging these forms required contextual morphosyntactic information to resolve the categorical uncertainty of words.<\/span><span data-ccp-props=\"{}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\">Tagging ICE-SIN not only provides important insights into the contact-induced changes in Singapore English, but also demonstrated the feasibility and benefits of linguistically annotating other Outer Circle varieties within the ICE project. A tagged ICE-SIN allows for data-driven investigation of language contact in unprecedented detail. More broadly, systematically annotating the various ICE corpora would further establish the corpus as an invaluable resource for quantitative research on language variation.\u202f<\/span><span data-ccp-props=\"{}\">\u00a0<\/span><\/p>\n<p><span data-contrast=\"none\">Read the full article <\/span><a href=\"https:\/\/doi.org\/10.1111\/weng.12597\"><span data-contrast=\"none\">here<\/span><\/a><span data-contrast=\"none\">.<\/span><span data-ccp-props=\"{}\">\u00a0<\/span><\/p>\n<figure id=\"attachment_33658\" aria-describedby=\"caption-attachment-33658\" style=\"width: 2560px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"size-full wp-image-33658\" src=\"https:\/\/fass.nus.edu.sg\/srn\/wp-content\/uploads\/sites\/15\/2024\/07\/Poster-using-Singlish-e1722406690567.jpg\" alt=\"\" width=\"2560\" height=\"1440\" srcset=\"https:\/\/fass.nus.edu.sg\/srn\/wp-content\/uploads\/sites\/15\/2024\/07\/Poster-using-Singlish-e1722406690567.jpg 2560w, https:\/\/fass.nus.edu.sg\/srn\/wp-content\/uploads\/sites\/15\/2024\/07\/Poster-using-Singlish-e1722406690567-300x169.jpg 300w, https:\/\/fass.nus.edu.sg\/srn\/wp-content\/uploads\/sites\/15\/2024\/07\/Poster-using-Singlish-e1722406690567-1024x576.jpg 1024w, https:\/\/fass.nus.edu.sg\/srn\/wp-content\/uploads\/sites\/15\/2024\/07\/Poster-using-Singlish-e1722406690567-768x432.jpg 768w, https:\/\/fass.nus.edu.sg\/srn\/wp-content\/uploads\/sites\/15\/2024\/07\/Poster-using-Singlish-e1722406690567-1536x864.jpg 1536w, https:\/\/fass.nus.edu.sg\/srn\/wp-content\/uploads\/sites\/15\/2024\/07\/Poster-using-Singlish-e1722406690567-2048x1152.jpg 2048w\" sizes=\"auto, (max-width: 2560px) 100vw, 2560px\" \/><figcaption id=\"caption-attachment-33658\" class=\"wp-caption-text\">Photo: \u2018Poster using Singlish\u2019 by Kelman Chiang, from SRN\u2019s SG Photobank<\/figcaption><\/figure>\n<p><span data-ccp-props=\"{}\">\u00a0<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Since its inception in 1988, the International Corpus of English (ICE) has been a cornerstone for research on World Englishes, comprising 14 countries\u2019 corpora from both the Inner Circle countries like Britain and the US to the Outer Circle countries like Singapore and the Philippines. Grammatically annotating the ICE corpora is a tall order due [&hellip;]<\/p>\n","protected":false},"author":311,"featured_media":33658,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"default","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","theme-transparent-header-meta":"default","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"set","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-opacity":"","overlay-gradient":""}},"footnotes":""},"categories":[4547,4529,4606,4609,4604],"tags":[],"class_list":["post-33657","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-english-language-and-literature","category-news","category-research","category-singapore-research-nexus","category-visible"],"acf":[],"_links":{"self":[{"href":"https:\/\/fass.nus.edu.sg\/srn\/wp-json\/wp\/v2\/posts\/33657","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/fass.nus.edu.sg\/srn\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/fass.nus.edu.sg\/srn\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/fass.nus.edu.sg\/srn\/wp-json\/wp\/v2\/users\/311"}],"replies":[{"embeddable":true,"href":"https:\/\/fass.nus.edu.sg\/srn\/wp-json\/wp\/v2\/comments?post=33657"}],"version-history":[{"count":3,"href":"https:\/\/fass.nus.edu.sg\/srn\/wp-json\/wp\/v2\/posts\/33657\/revisions"}],"predecessor-version":[{"id":35482,"href":"https:\/\/fass.nus.edu.sg\/srn\/wp-json\/wp\/v2\/posts\/33657\/revisions\/35482"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/fass.nus.edu.sg\/srn\/wp-json\/wp\/v2\/media\/33658"}],"wp:attachment":[{"href":"https:\/\/fass.nus.edu.sg\/srn\/wp-json\/wp\/v2\/media?parent=33657"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/fass.nus.edu.sg\/srn\/wp-json\/wp\/v2\/categories?post=33657"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/fass.nus.edu.sg\/srn\/wp-json\/wp\/v2\/tags?post=33657"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}