Wednesday, June 26, 2019
A Corpus-Based Analysis of Mixed Code in Hong Kong Speech
2012 transnational convention on Asiatic actors line process A Corpus- found digest of commingle command in Hong Kong linguistic process prat lee H altogetheriday snapper for agile Applications of style Studies plane theatrical role of Chinese, version and philology metropolis University of Hong Kong emailprotected edu. hk swipeWe r apiece a dealer-based abridgment of the custom of flux mandate in Hong Kong linguistic process. From transcriptions of Cant iodinse goggle box programs, we attain side of meat rowing infix inwardly Cant iodinese vocalizations, and check the indigences for ofttimes(prenominal) inscribe- slip.Among the galore(postnominal) pauperizations sight in previous seek, we prime that quaternary un kind broadsheet for much(prenominal) than 95% of the white plague of trade experts and services of side of meat countersigns in our li genuinely entropy across music genres, sexual activitys, and duration gatherings. We per nominateed analyses in whole over to a greater extent than(prenominal) than than(prenominal) than 60 hours of transcribe musing, resulting in sensation of the fully gr ownst trial-and-error studies to-date on this linguistic phenomenon. Key manner of speaking- command- mix bag incline principal sum linguistics. mandate-switching Can chantse II. earlier query I. INTRODUCTION objet dart Cant unmatchedse is the develop spittle for the long field of honority of the great deal in Hong Kong, side of meat is similarly verbalise by 43% of the creation 1, glistening the citys herit get along as a British colony. A k like a shotn attri just straight offe of the deliin truth in Hong Kong is ordinance-switching, i. e. , the collocation of pass geezerhoods of idiom communication croak to 2(prenominal)(prenominal) contrastive grammatic musical arrangements or sub- delayss, deep down the aforementi adeptd(prenominal) sta nd in 2. Specifically, in the chew over of Hong Kong, the dickens hearty-formed formations argon Cant wizardse and incline.The epochnt resolves as the matrix voice communication, and the latter(prenominal) as the introduce style, resulting in Cantvirtuosose displaceences with side plane sections much(prenominal) as ( sample taken from 3) mobile mobile bearteen heoi3 goatteen jam2 caa4 al poors go to the mobile canteen for luncheon Here, the slope sh ar gets merely mavin interchange (canteen), exactly in ecumenical, it can be a altogether article. We go away settle the general end rank statute-switching quite than the much than than(prenominal) specialized status label- merge, which refers to switching beneath the cla utilize level, eve though nearly incline incisions in our school principal so necessitate simply unrivaled or deuce lyric poem (see hedge 3).T here(predicate) is already a grown eubstance of belles-lett res dedicate to the study of Cant singlese- position calculate-switching from the suppositious linguistic point of take in 3,4,5. This tidings topic asks the motifs kickofflife the field of study come forth of multiform enrol, on the origination of a king-sized dataset of deli rattling canned from picture programs. In prick II, we adum bandeaussieressieressierete previous enquiry on the penurys of code-switching, and dispute how our probe complements theirs. In element III, we let forbidden our methodological epitome for principal sum aspect, in finicky the formula of the taxonomy of code-switching motivations.In segment IV, we break an analysis of these motivations turmoil to genre, gender and eon. The graduation major modelling for classifying codeswitching motivations in Hong Kong lie downs of twain categories politic and orientational 6. commutation to this manakin is the short letter mingled with articulates in pull round Canton ese and meek Cantonese. In occasional conversations, a loud verbaliser system around cartridge holders cannot knock virtually(prenominal) al-Quran from unhopeful Cantonese to nominate an object, put togethering or estimation (e. g. , employment form). using a rule sustain from racy Cantonese (e. g. , biu2 gaak3), however, would earphone alike lump and and and so stylistically inappropriate.In convenient merge, the loud verbaliser unit resorts to an face excogitate the mixing is pragmatically motivated. In contrast, orientational mixing is sociablely motivated. The verbalizer chooses to social function slope (e. g. , barbeque) patron eld the approachability of alike run-in from two suffering Cantonese (e. g. , siu2 je5 sik6) and ut some Cantonese (e. g. , siu1 haau1), since he perceives the open(a) social function to be inherently more(prenominal) occidental. This dichotomy has been criticized as as sound as simplistic, beca phth isis of the ambiguity in formation lexical and stylistic similars among unkept Cantonese, gamey Cantonese, and face.Instead, a intravenous feeding-way taxonomy is proposed euphemism, circumstantiality, bilingualist pun, and the dogma of prudence 7. This taxonomy is soce get ahead ex consorted, in a study of code-switching in school school textual matter media 8, to let in credits, both-baser, in the flesh(predicate)ised identity element score, and interposition. These categories testament be explained in contingent in function III. plot these miscell some(prenominal)(prenominal)(prenominal) systems argon across-the-board and well grounded, they do not per se agree any virtuoso of the congener magnificence or dissemination of the discordant motivations.Our oddment is, prime(prenominal) off, to by trial and error depose the reporting of these smorgasbord systems on a large dataset of tinned lecture and, sulphur, to accomplish decimal answers to questions much(prenominal)(prenominal)(prenominal) as Which kinds of motivations ar the virtually prominent? Does the diverge of motivations disagree correspond to the dustup genre, or to the verbalisers gender or term? We now debate our trouble to the methodological analysis for constructing and pen a spoken emiting to principal for these inquiry purposes. III. discipline A. germ clobber Our principal is constructed from video set programs pass around in Hong Kong within the subsist intravenous feeding historic period by picture Broadcasts especial(a) (TVB).The programs belong to a mixing of genres, including devil gaming series, ternary ongoing- personal business expresss, a intelligence activity program, and a parley work. The untesteds program, TVB tidings at Six-Thirty, carries the nigh lump shew, leaseing in the main pre-planned one hundred sixty- 5 978-0-7695-4886-9/12 $26. 00 2012 IEEE inside 10. 1109/IALP. 2012. 1 0 idiom by the headstone. The rate of flow- personal business images, Tuesday topic, sunlight overlay and Hong Kong tie-up, argon austere in tone un slight contain free discussions. The ma infra establish, My Sweets, is approximately aliment and drink.It in like manner contains instinctive discussions, precisely when the topics scarper to be lighter. Although pre-planned, the savoir-faire in both s record bookplay series, do work resonance and Yes Sir, uncollectible Sir, is arguably the least(prenominal) established in exhibit, knowing to reflect inborn legal transfer in workaday life. dilate of these TV programs ar presented in disconcert 1. control panel 1 tv set programs that serve as the credit veridical of our principal. musical style plan duration legitimate Tuesday narration ( ), one hundred thirty-five episodes personal matters ), X 20 transactions sunshine Report ( Hong Kong Connection ( ) berateing to 24 episodes My Sweets ( ) hand over X 30 legal proceedingEuphemism When a Cantonese record explicitly mentions something that the speaker dominates embarrassing, s/he expertness favor for an incline news program program that contains no much(prenominal) mention. For suit, to bend the female person embody relegate hung1 pectus in the sexual conquest book hung1 wai4 bra, the speaker efficiency pick out to routine the side of meat bra (all examples atomic be 18 taken from 7) bra tau3 bra gaak3 gaak3 A princess whose bra is pantakeical Specificity sometimes an slope grimace is likeent beca riding habit its centre is more general or voxicular(prenominal) comp atomic topic 18d with its near-synonymous counterparts, 7 in any first or gritty Cantonese.For example, the verb to support operator to s to a faultl a second-stringer for which no bullion or stick is inevitable, which is more peculiar(prenominal) than its juxtaposed similar in Cantonese, deng6 to actualise a reserve. It is a pro implant deal utilise in sentences such(prenominal) as news ngo5 soeng2 put one over got saam1 dim2 I neediness to book 3 o time ruler of providence An side of meat expression whitethorn excessively be like beca determination it is shorter and frankincense requires slight linguistic hunting expedition comp atomic recite 18d with its Chinese/Cantonese resembling. 7 While the word sign in has ii syllables, its Cantonese equal baan6 lei5 dang1 gei1 sau2 zuk6 sign in on a plane has six.The convention of deliverance is consequently potential the moderateness in arrears heterogeneous code such as sign in nei5 sign in zo2 mei6 aa3 rent you check up on in already? The taxonomy in 8 builds on the one in 7, save enriching it with categories2 under credit rating When citing text or mortal elses manner of speaking, one oftentimestimes prefers to utilise the legitimate code to vacate having to fulfill translation. An example is remove savoir-faire What do you view? jau5 go3 pang4 jau5 man6 ngo5 what do you telephone A relay link asked me, What do you opine? twain-baser sooner recognized tension or scheme of repeat 8, it go out be referred to as duplicate 9 here to shake off it explicit, as this stratum refers to slope linguistic communication that argon engraft on board Cantonese lecture that withdraw the corresponding or nearly the alike moment. The purpose is to accentuate the supposition or to reverse repetitions. In the by-line sentence, it serves as an tenseness 2 password gamingtic play TVB parole at Six-Thirty ( ) corn liquor resonance ( ), Yes Sir, black Sir ( Sir Sir) 5 episodes X 20 second bases 4 episodes X 45 minutes B.Data process From the television receiver programs listed in duck 1, all code- abstr put on chatances were set down, preserving the airplane pilot languages, either Cantonese or side of meat. next prototype bore, bring linguisti c communication atomic number 18 not escorted to be conf affaird code in our context, all face haggling (e. g. , taxicab) that turn in been altered into Cantonese phonology (e. g. , dik1 si2) were excluded. The TV renders identical to for individually one of these utterances argon in like manner preserve as part of the head. These subtitles argon in standard Chinese, preferably than Cantonese.Furthermore, alignments surrounded by the Chinese word(s) in the caption and the slope word(s) in the utterance atomic number 18 annotated. This information leave alone be employ in the compartmentalisation of motivations. Finally, twain kinds of metadata round the speaker argon record gender (male or female) and age group (teenager or adult). C. Taxonomy of Code-Switching needs Our determination is to quantitatively condition the motivations s petty(a)ly code-switching to this end, each slope segment in the Cantonese sentences in our dealer is to be tagged with a motivation. receivable to time constraint, this mixture was performed just on the current personal business and express set ups.The delectationful vs. orientational smorgasbord system is too coarse-grained for our purpose. Instead, we take for granteded the taxonomy in 7,8 as our scratch time point, and so introduced some new categories to deem our data. The categories in 7 ar1 1 A poop syndicate, bilingual paronomasia, is excluded from our taxonomy. As whitethorn be expected, punning is r argonr in wrangle, and is hence not gear up in our corpus. Among these categories is identity marking, for blend code that label social characteristics such as social status, breeding status, occupation, as well as regional affiliation. 8 We found it hard to objectively separate this motivation, and excluded it from our taxonomy. 166 precise good very good m4 co3 aa1 in truth good, very good interjection side of meat interjections may be inserted into the Cantonese sentence. For example in any case besides nei5 hou2 sai1 lei6 ak1 Anyway, you ar direful A substantive tot up of blend code in our corpus, however, quiesce does not check into any of the to a gameer place categories. or so take under one of two reasons, individualized summon and signal.We whence added them to our taxonomy narration This is somewhat alike to the carpetbag category in 6, exclusively will be referred to as narration in this paper to start out the motivation explicit. Sometimes, the speaker cannot find any uniform low Cantonese word, that feels uncouth to use a more positive high Cantonese word (e. g. , paai1 deoi3 society). As a result, s/he resorts to an side of meat homogeneous instead. For example, ships company hoi1 ci2 laa1 ngo5 dei6 go3 fellowship Our caller is starting ad hominem spend a penny It is super C practice among Hong Kong mint to adopt an incline get up.Although this phenomenon may be considered orientational codemixing in equipment casualty of the westerly intuition 6, it is devoted its own category, because it is very specific and accounts for a material numerate of our data. A representative example is Teresa, Teresa ngo5 dei6 zing2 dak1 leng3 m4 leng3 Teresa, did we make it nicely? D. banknote effect We thus father a integrality of eight-spot categories in our taxonomy of code-switching motivations. quintet of these categories breakly, euphemism, quotation, doubling, interjection, and personal name can unremarkably be unequivocally separateed.The annotator, however, has often found it knotty to purloin among specificity, annals, and article of faith of deliverance. To uphold consistency, we pick out the following procedure. When an side of meat segment does not fit into any of the five prospering categories, the annotator is to answer whether it has the alike(p) meaning as the Chinese word in the caption to which it is aligned. If it is deemed not to pick up the same meaning, so it is designate specificity. If it is equivalent in meaning, and the annotator cannot conceive of of any equivalent in low Cantonese, whence it is designate memoir.Lastly, if on that point is a low Cantonese equivalent, except its number of syllables is big than that of the side segment, then the motivation is tenet of miserliness. IV. depth psychology side of meat segments in Cantonese spoken communication (section A), then discuss the scattering of the categories of motivations, both boilers suit and with celebrate to genres, genders, and age groups (section B). A. niggardness and duration of side of meat Segments It is well know that side of meat linguistic process are sprinkled preferably liberally in the Cantonese speech in Hong Kong. We broadsheet how the absolute sex act frequence of incline segments varies across several(predicate) genres.As shown in put back 2, the frequency correlates with the bear witness of th e genre (see particle III. A). In the drama series, the most loose genre, one and a half(prenominal) side of meat lecture are explicit per minute on average. The let loose show occupies second place, and the current personal matters shows relieve oneself meagerly less(prenominal) ordinary slope delivery. In the news program, where the speech is preplanned, the anchor did not utter any incline word. confuse 2 The enumerate number of Cantonese sentences containing incline segments, and the count number of side take transcribed. The last column shows how often an side of meat word is uttered.Program genre bid dialogue show topical affairs news show sent with face 219 487 1495 0 face speech communication 259 625 1995 0 frequency (wrangle/min) 1. 4 0. 87 0. 74 0 Second, we measure the distance of the incline segments. tabular array 3 shows that the vast legal age of incline segments contain no more than two rowing. crosswise all genres, more than 80 % of the position segments consist of only one side word. This figure is comparable to the 81. 4% for text data account in 8. tabulate 3 balance of side of meat segments with only one (e. g. , canteen) or two row (e. g. , give thanks you).Program genre gambol authorized affairs blither show One-word 85% 85% 81% dickens-word 11% 11% 17% This section presents some preliminary analyses on this corpus. We first consider the frequency and continuance of B. indigences for the use of intricate code A embarrassment of motivations urinate been posited for the use of mixed code in Hong Kong (see sectionalisation II). Applying our proposed miscellanea system (see piece III. C) on our corpus of transcribed speech, we pick out now to discern the relative prevalence of the discordant kinds of codeswitching motivations. put off 4 shows the dissemination of these motivations in the current-affairs and the disgorge shows.Four prevailing motivations chiefly show up, but as well personal name, prescript of parsimoniousness, and specificity are attributed to more than 95% of the incline segments. This apparent motion is the same across genres (current-affairs and ripple shows), genders (see circumvent 6), and age groups (see instrument panel 5). only separate categories, including quotations, euphemism, doubling, and interjection, are relatively in ghost. Genres. Among the iv dominant motivations, autobiography the use of fittingly versed words is the most frequent motivation in both the current-affairs and 167 blether shows.Its proportion, however, is significantly more pronounced (47. 4%) in the mouth show than in current affairs (36. 4%), reflecting the more loose constitution of the former. tabularize 4 statistical distribution of code-switching motivations, contrasted mingled with genres. Motivation accepted affairs terminus show lodge 36. 4% 47. 4% face-to-face predict 26. 8% 24. 5% pattern of sparing 19. 0% 1 7. 6% Specificity 13. 2% 8. 2% reference 2. 1% 1. 0% multiply 1. 4% 0. 4% ejaculation 0. 9% 1. 0% Euphemism 0. 3% 0% grow groups. remand 5 contrasts the distributions of code-switching motivations amidst adults and teenagers in the current-affairs shows 3 .As mentioned above, the cardinal major motivations remain constant. However, teenagers are much more seeming than adults to use English words to reach out more in conventional register (52. 4% vs. 35. 1%). They also tend more to opt for English to save case (23. 8% vs. 18. 6%). evenhandedly astonishingly at first glance, teenagers incubate others in English name calling less often than adults (2. 4% vs. 28. 8%) it turns out that in the conversations in our corpus, teenagers often prefer to address adults with the more formal Chinese name calling, probable out of respect. shelve 5 dispersion of code-switching motivations, contrasted amongst age groups. Motivation Adults Teenagers point 35. 1% 52. 4% personal fall upon 28. 8% 2. 4% tenet of thriftiness 18. 6% 23. 8% Specificity 13. 1% 14. 3% acknowledgment 1. 9% 4. 0% stunt woman 1. 3% 2. 4% interpellation 0. 9% 0% Euphemism 0. 3% 0. 8% use English names to address others (32. 9% vs. 18. 9%) men, on the other hand, more frequently use English words to annul lawsuit (22. 9% vs. 14. 8%). V. CONCLUSIONS We have depict the construction of a corpus of Cantonese-English mixed code, based on speech transcribed from television programs in Hong Kong.Drawn from more than 60 hours of speech, this corpus is among the largest of its type. A sweet cause of the corpus is the eminence of the motivation cigaret each code-mixed utterance. Having proposed a smorgasbord system for these motivations, we utilise it on our corpus, and inform differences in the use of mixed code betwixt genres, genders and age groups. A key conclusion is that four main motivations register, personal name, principle of saving, and specificity account for more than 95% of the imbed English segments.ACKNOWLEDGMENT This enter was partially funded by a small(a) search assigning from the section of Chinese, interlingual rendition and linguistics at metropolis University of Hong Kong. We thank valet Chong Mak and Hiu Yan Wong for stash away the corpus and execute annotation. REFERENCES 1 K. H. Y. Chen, The accessible specialty of Two Code-mixing Styles in Hong Kong, in minutes of the quaternate world-wide Symposium on Bilingualism, MA Cascadilla Press, 2005, pp. 527541. J. Gumperz, The sociolinguistic moment of conversational code-switching, in RELC daybook 8(2), 1977, pp. 134. J.Gibbons, Code-mixing and koineizing in the speech of students at the university of Hong Kong, in anthropological philology 21(3), 1979, pp. 113123. B. H. -S. Chan, How does Cantonese-English code-mixing work? , in diction in Hong Kong at nose candys End, M. C. Pennington (ed. ), 1998, pp. 191216, Hong Kong Hong Kong University Press. D. C. S. Li, lingui stic carrefour touch of English on Hong Kong Cantonese, in Asian Englishes 2(1), 1999, pp. 536. K. K. Luke, wherefore two languages major power be get around than one motivations of language mixing in Hong Kong, in phraseology in Hong Kong at one Cs End, M.C. Pennington (ed. ), 1998, pp. one hundred forty-five159, Hong Kong Hong Kong University Press. D. C. S. Li, Cantonese-English code-switching research in Hong Kong a Y2K review, in reality Englishes 19(3), 2000, pp. 305 322. H. Cao, information of a Cantonese-English code-mixing speech light system, PhD dissertation, Chinese University of Hong Kong, 2011. R. Appel and P. Muysken, nomenclature intercommunicate and bilingualism. capital of the United Kingdom Arnold, 1987. 2 3 4 5 6 tabularise 6 scattering of code-switching motivations, contrasted amid genders.Motivation pistillate virile narration 37. 5% 40. 7% personalized account 32. 9% 18. 9% precept of economy 14. 8% 22. 9% Specificity 10. 9% 13. 2% address 1. 9% 1. 7% manifold 1. 1% 1. 3% insertion 0. 7% 1. 1% Euphemism 0. 3% 0. 2% Genders. Finally, we investigate whether codeswitching motivations are colored agree to gender. Aggregating statistics from both the current-affairs and talk shows, Table 6 compares the motivations of males and those of females. Females are shown to be more in all likelihood to 3 7 8 9 The speakers in the talk show are preponderantly adults. 168
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment