연구과제 상세정보

https://www.krm.or.kr/krmts/link.html?dbGubun=SD&m201_id=10006964&res=y

영어 파열음 유,무성성의 음향과 인지 비대칭성 연구

연구자가 한국연구재단 연구지원시스템에 직접 입력한 정보입니다.

사업명	신진연구장려금지원 [지원년도 신청 요강 보기 ]
연구과제번호	2004-908-A00059
선정년도	2004 년
연구기간	1 년 (2004년 11월 01일 ~ 2005년 10월 31일)
연구책임자	강석한
연구수행기관	연세대학교
과제진행현황	종료

과제신청시 연구개요

연구목표
본 논문은, 음변화의 한 과정으로서 '파찰음화'를 연구하였다. 이 파찰음화현상은 기존의 조음적인 측면에서는 'lenition'현상으로 보려는 경향이 주류를 이루었으며, 근래에는 'perceptual'측면에서 음변화를 'trigger'한다는 시각이 새롭게 대두되고 있다. 본 논문은 ...

본 논문은, 음변화의 한 과정으로서 '파찰음화'를 연구하였다. 이 파찰음화현상은 기존의 조음적인 측면에서는 'lenition'현상으로 보려는 경향이 주류를 이루었으며, 근래에는 'perceptual'측면에서 음변화를 'trigger'한다는 시각이 새롭게 대두되고 있다. 본 논문은, '음향학'적인 측면에서 Old High German 언어의 'spirantization'에 접근하여, 왜 파열음이 어떤 음절위치에서는(특히, onset) 파찰음으로 변하고, 또, 어떤위치 (inter-vocalic)에서는 마찰음, 혹은 flap, 그리고 모음으로 변하는지를 음향학적인 단서를 가지고 연구하였다.
본 연구에서는 이러한 음변화의 방향과 정도가 다른것은 음향학적인 'energy level'이 다르기 때문으로 보았고, 이를 'Fletcher-Munson scale'를 가지고 연구하였다. 이 energy level의 측정은 kirchner(2003)에서 이루어졌지만, 그의 연구방식은 'logarithm'적인 'energy level'를 'linear'로 처리했다는 점에서, 본 연구는 이를 수정 보완하였다.
본 연구의 목표는 다음과 같다.
1) 마찰음화 현상을 정확히 이해하기 위해서는, 조음적인 측면뿐만 아니라, 음향적인 측면에서 접근을 해야한다.
2) 마찰음화 현상은 마찰음뿐만 아니라, 파찰음, 파열음, flap, 모음과의 관계도 규명해야 정확히 이해될수 있다.
3) 마찰음화 현상을 음절내에서 이해할려고한 기존의 연구들(Iverson, Salmons, Holsinger, Kim)은 재고되어야 하며, cue를 근간으로 하는 feature의 특성이 음변화를 이끈다는 가설을 입증할려고 한다.
4) 이러한 음운론적인 목표를 위하여, Fletcher-Munson Scale 에너지 측정방식이 도입되고, pitchworks, Prrat, 의 음성 software와, 통계처리를 위하여 spss가 사용될 것이다.
5) 한국어에서 나타나는 음절두음에서의 경음화 현상이 나타나는 non-conditional change에 대하여는 설명이 미흡했는데, 본 연구를 이용하면 쉽게 설명할수 있다는 장점이 있다.

기대효과
본 논문은 마찰음화 현상뿐만 아니라, 마찰음과 페쇄음, 마찰음과 파찰음, 마찰음과 그밖의 다른 자음계통과의 변이및 변천관계를 음향적 기준에 따라 설명할려고 한다. 따라서, 본 논문의 완성되면, 기존의 조음적인 측면에서 마찰음화를 이해할려는 시각이 상당히 교정 ...

본 논문은 마찰음화 현상뿐만 아니라, 마찰음과 페쇄음, 마찰음과 파찰음, 마찰음과 그밖의 다른 자음계통과의 변이및 변천관계를 음향적 기준에 따라 설명할려고 한다. 따라서, 본 논문의 완성되면, 기존의 조음적인 측면에서 마찰음화를 이해할려는 시각이 상당히 교정이 될것이며, 이에따라 다른 음운현상 - 동화, 중화, 구개음화, -등도 음향적인 측면에서 이해될수 있는 단초를 제공해 줄것이다. 그리고, 무엇보다 한국어에서 나타나는 '경음화'현상도 쉽게 이해될수 있는 근거를 제시해 준다.
이러한 분석위한, Fletcher-Munson 기법은 언어학에 처음 도입되는 '음향학'적인 측정기법으로 이를 이용하여 조음적인 'energy level'이 객관적으로 측정이 될것이다. 사실, energy level의 측정은 Kirchner(2003)에 의하여 처음 이루어졌지만, 이는 'logarithm'적인 요소를 갖는 amplitude를 'linear' 로 해석함으로서, 측정방식에 심각한 오류를 범했다. 따라서, 본 저자는 이를 재편하여, 정밀한 측정이 이루어질수 있도록 모델을 새로 만들었다.
이런 모델은 음변화에서의 (특히 non-conditional change) 변화의 방향과 정도를 예측하는데, 이용할수 있고, 이는 특정언어에서만 나타나는 현상이 아니라, 모든 언어에서의 공통현상임을 규명할것이다.
본 연구가 계획대로 진행이 된다면, 외국인 한국어 습득자를 위한 음성 개발 software도 만들어볼수 있다. 즉, amplitude및 energy level를 정해준다음, 한국어 습득자들에게 연습을 시킨다면, 영어원어민 한국어 습득자들에게 가장 어려움을 겪는 '경음'을 쉽게 배울수 있다는 장점이 있다.
그리고, 본 연구의 최대의 기대효과로 볼수있는것은, 음변화를 수치화 함으로서 각음간의 변화를 일목요연하게 파악할수 있고, 이는 언어병리분야에서 어떤 음의 발음에 어려움을 겪는 환자들에게 치료법을 제시해줄수 있다.

연구요약
Spirantization mostly refers to the first consonant shift in Grimm's law, occurs as the side of synchronic, not just as of diachronic. Synchronic spirantization has been ignored in the phonetics or phonology. This paper will study the current spiran ...

Spirantization mostly refers to the first consonant shift in Grimm's law, occurs as the side of synchronic, not just as of diachronic. Synchronic spirantization has been ignored in the phonetics or phonology. This paper will study the current spirantization; Liverpool English, Florentine Italian, Taiwanese, and Uradhi.
The paper will answer the following questions:
1) does the spiratnization universal phenomenon?
2) what is the relation between spirantization and other classes of phonological patterns, such as assimilation?
3) what factors lead the spirantization in both fields of articulatory and perception?
4) why does spirantization overwhelmingly produce weak fricatives or approximants, rather than strident fricatives, which otherwise appear to be the unmarked fricatives?
5) in Korean which exist only one fricative [s], what sound can be obtained from spiratization?
6) what's the realtionship between spiratnization and prosody?
For answering these questions, I will adopt phonological knowledge as well as phonetic experiment. The hypothesis of the paper is to reveal that not only the sound in stressed syllable position change easily rather than one in unstressed syllable, but also the stops in the position changes to afficates because of the energy load to the sound rather than ones in unstressed position to flaps. For proving the hypothesis, both optimality theory and energy experiment using Pitchworks, Prrat, and SPSS statistics.
I will argue that phonological spirantization is driven by conflict between minimizing articulatory effort and maximizing perceptual distinctiveness, resolved through Optimality Theoretic constraint ranking. In the process, I will use the phonetic experiment to support my argument. I will draw the conclusion that whether the spirantisation changes to affricate, fricative, flaps depends on the phonetic cues like [energy level](production), or [onset boost](perception). This conclusion will connect to the notion of Steriades' "Licensing by Cue", in which features are licensed only by their cues. It means that the degree and direction of the sound change depend on the characteristics of features supported by various cues. For example, onset non-coronal stops in Old High German easily change to affricates because the degree of feature [spread glottis] is maximum(or active) in the onset position, in which cues like [energy level] and [onset burst] are most active.
I will extend Steriade's concept of Licensing by Cue to the study of sound change. I will draw the conclusion that sound change may be licensed by cues, not by prosodic position. My study will deny many scholars' conclusion (Iverson 1996, 2001, 2002. Salmons 1996, 2000, Holsinger 2001, Kim 2002) that the High German Consonantal Shift primarily is based on prosodic influence. Furthermore, I will extend to Liverpool English (Wells, 1982. Honeybone, 2001), Floretine Italian (Giannelli & Savoia, 1979), Taiwanise (Hsu, 1995), and Uradhi (Dixon, 1979) that the sound change by "licensing by cue" is universal.
For the study, I will experiment the energy scale. I measure energy scale depending on syllable position, sound place, and manner. Unlike Kirchner Energy measurement(= intensity (db) * times (ms)) measurement, because it cause the mistake that the logarithm scale multiplies the linear scale (duration), I will set up the new approach to the energy measurement.

Energy = intensity(Fletcher-Munson scale(watt)) * times (ms)

First, I will draw the Fletcher-Munson scale(watt) just because of transferring the linear level,
Second, I will use pitchworks experiment for measuring the energy level depending on syllable position, consonant place and manner.
Third, I will use SPSS 12.0 for descriptive statistics.
Fourth, I will convert the scores into the Energy Level.
This study will reveal why prosodic onset sound changes to the affricate in Old High German, post-v

한글키워드
음약화,음절 비대칭,파찰음화,마찰음화,조음음성학,통계언어학,고대고지독어,음변화,음향언어학,에너지레벨

결과보고시 연구요약문

국문
본 논문은 영어 파열음의 유/무성대조를 음향과 인지관계에서 살펴보기 위하여, 시구간신호(temporal cues)를 대상으로, 각 환경(CV, VCV, VC)에서 발화분석과 인지실험을 통하여 비교, 분석하였다. 그 결과 첫째, 음향 측면에서 유의미한 신호가 반드시 인지 측면에서 유 ...

본 논문은 영어 파열음의 유/무성대조를 음향과 인지관계에서 살펴보기 위하여, 시구간신호(temporal cues)를 대상으로, 각 환경(CV, VCV, VC)에서 발화분석과 인지실험을 통하여 비교, 분석하였다. 그 결과 첫째, 음향 측면에서 유의미한 신호가 반드시 인지 측면에서 유의미한 신호가 되는 것은 아니며, 둘째로 인지에서 유의미한 신호는 각 환경에 따라 유/무성대조에 미치는 영향이 다르고, 따라서 음향과 인지사이에는 비대칭이 존재한다는 사실을 밝혔다. 음향(acoustic)과 인지(perception)의 비대칭은 단순히 음향에서의 주요 신호가 인지에서의 주요 신호로 작용하지 못함을 의미한다. 즉, 인지 정도는 음향적인 신호로만 구성되는 것이 아니라, 심리음향학적(psychoacoustic) 요소가 결합되어 청각(audition)에 영향을 주며, 이 청각에서의 인지작용이 인지도에 영향을 준다.
청각에서의 인지작용은 시구간신호의 우열이 존재함을 의미한다. 예를 들면, 음향상 폐쇄구간과 VOT신호는 유/무성 구분에 있어서 같은 정도의 유의도를 지닌다. 그러나 인지측면에서 폐쇄구간은 미약한 신호의 질(quality)로 인하여 유/무성 구분정도가 떨어지는 반면에, VOT신호는 매우 우수한 신호가 된다. 따라서, 이러한 음향/인지비대칭은 영어파열음의 유/무성을 결정하는 신호의 서열가설을 정립하는 기반을 제공해 준다. 즉, 이 가설은 ‘개방단계신호(파열, VOT, 후행 모음구간) >> 접근단계신호(선행모음구간) >> 폐쇄단계신호(폐쇄묵음구간, 폐쇄중 유성구간)’서열로 구성된다고 주장한다.
이 가설을 입증하기 위하여, 발화분석과 인지실험을 하였다. 그 결과, VOT와 후행모음구간신호를 포함하는 개방단계신호가 가장 우수한 인지신호이며, 선행모음구간을 포함하는 접근단계신호가 그 뒤를 잇고, 폐쇄묵음구간과 폐쇄중 유성구간을 포함하는 폐쇄단계신호는 가장 빈약한 음향신호임이 밝혀졌다. 이러한 서열이 생기는 원인으로, 모음부분에 구현되는 스펙트럼신호(예: F1, F2수치 및 전이, 형태 등), 초분절 요소인 ‘음고저’(pitch)가 개방 및 접근단계신호에 구현됨으로서 유/무성을 구분하는 우수한 신호가 된다.

영문
The study tries to find out what acoustic and perceptual cues play the main role to draw the contrast of English plosives. The hypothesis is that the voiced-voiceless distinction in each environment is decided on the cues not by acoustic level, but ...

The study tries to find out what acoustic and perceptual cues play the main role to draw the contrast of English plosives. The hypothesis is that the voiced-voiceless distinction in each environment is decided on the cues not by acoustic level, but by perceptual level. For example, 'release' cue in VC environment is not the main cue in the acoustic level, but the main cue in the perceptual level. For proving it, the experiments are carried out in each environment(CV, VCV, VC) using temporal cues and the results are analyzed.
The results are like followings; 1) the significant cues in acoustics are not in parallel with those in perception, 2) those cues in perception have the different influence rate to the voiced-voiceless distinction, 3) naturally the asymmetry between two exists. The asymmetry between acoustics and perception means that the major cues in acoustics necessarily cannot be those in perception. In this respect, the perceptual scale may not depend on acoustic cues, but on psychoacoustic factors. The point is that perception process in auditory influences the perceptual scale.
The perception process means the hierarchy of the temporal cues. For the example, although the cues of closure duration and of VOT have the same significant rate in voiced-voiceless distinction, the influence power in the field of perception is different. That is, the closure duration is revealed as the weak cue because of the perceptual-low quality, but VOT as the superior cue. Naturally the asymmetry between two sets up the hypothesis of the cue hierarchy that shows "release cues(VOT, post-vowel) >> approach cues(pre-vowel) >> closure cues(closure duration, closure voicing)."
The perceptual experiment shows that the release cues including VOT and post-vowel are the most powerful, that the approach cues including pre-vowel follows, and that the closure cues are the weakest. As causes of the cue hierarchy, the spectra cues(e.g., F1, F2, values, transition, and shapes) are realized in the neighboring vowels, and pitch as one of suprasegmental factors is shown in the parts of approach and release stage.
The study finds out that voiced and voiceless sounds have the different perceptual sense. The voiceless sounds are sensitive to the aperiodic energy such as VOT or release and depends on the spectra cues realized in pre-vowel under the environment that the aperiodic energy is diminished. On the contrary, the voiced sounds are wholly dependent on the periodic shown in the neighboring vowels. The perception difference between voiced-voiceless sounds comes from the difference of auditory impact.

연구결과보고서

초록
영어 파열음에 대한 음성학측면의 연구들은, 유/무성인지는 VOT(Liberman et al. 1958; Lisker & Abramson 1964, 1967, 1971; Plauché 2001), 파열(Malécot 1958; Byrd 1993), 선행모음길이(Massaro & Cohen 1983), 폐쇄구간(Lisker 1957, 1986; Parker & ...

영어 파열음에 대한 음성학측면의 연구들은, 유/무성인지는 VOT(Liberman et al. 1958; Lisker & Abramson 1964, 1967, 1971; Plauché 2001), 파열(Malécot 1958; Byrd 1993), 선행모음길이(Massaro & Cohen 1983), 폐쇄구간(Lisker 1957, 1986; Parker & Kluender 1986), 자음/모음 구간비율(Massaro & Cohen 1983), F0(Kingston & Diehl 1994), F1(Liberman et al. 1958; Kingston & Diehl 1994), F2(Delattre et al. 1955)등의 음향신호에 의하여 결정된다고 보았다.
그러나 음운인지과정에서 이러한 음향신호들의 역할이 서로 다를 것이라는 연구는 일부 이루어져왔는데(Lisker & Abramson 1964, 1971; Repp 1979; Nearey 1997), 이 연구들은 주로 VOT가 범 언어적으로 유/무성을 구분하는 주요한 신호라는 것이다. VOT는 각 언어의 유/무성파열음의 음향측정을 통하여(영어, 스페인어, 태국어: Lisker & Abramson 1970; 한국어: Lisker & Abramson 1971; 레바논 아랍어: Yeni-komsian et al. 1977), 각 언어의 후두자질을 규명하는 중요한 신호로 인정(Iverson & Salmons 1995)될 뿐만 아니라, 구간길이 및 스펙트럼형태에 의하여 유/무성과 조음위치가 결정된다고 보았다(Plauché 2001).
이러한 VOT위주의 파열음 연구는, 과연 이 신호가 유/무성인지에 결정적인 신호인가에 대하여 회의적인 시각이 나타났다(Serniclaes & Bejster 1979). 이 대안으로서 다방면의 신호를 이용한 인지실험이 실시되었는데, 주로 자연음의 편집(Williams 1977), 인공합성음에 대한 여러 신호의 조작(Serniclaes & Bejster 1979) 을 통하여 이루어져왔다. 이러한 실험을 통하여 유/무성인지는 VOT에만 의존하고 있지 않음을 보여주었다.
본 연구에서는 VOT뿐만 아니라, 모든 시구간신호(temporal cues)를 대상으로 유/무성인지에 작용하는 신호의 서열을 규정하는 작업을 하고자 한다. 이는 기존의 인지연구가 VOT를 위주로 한 일부신호의 비교나(Serniclaes & Bejster 1979), 단일신호의 구간에 의한 분석(Lisker & Abramson 1964; Plauché 2001)에 그친다는 점을 반성하여 종합적으로 분석하려고 하였다. 이를 위하여, 주요한 시구간신호에 해당하는 6개의 신호를 선정하였다. 이는 선정된 신호를 대상으로, 음향과 인지 측면을 서로 비교함으로서, 두 측면사이의 신호역할에 대한 비대칭과 신호서열과의 관계를 규명하기 위해서다.

연구결과 및 활용방안
본 연구는 발화와 청각실험을 통하여 음향과 인지영역간의 비대칭을 연구하였다. 비록 음향상으로는 같은 정도의 통계적 유의도를 보일지라도, 인지측면에서는 많은 차이를 가져온다는 점을 보여주었다. 단일 신호측면에서는 VOT보다는 ‘선행모음’신호가 유/무성을 구분하 ...

본 연구는 발화와 청각실험을 통하여 음향과 인지영역간의 비대칭을 연구하였다. 비록 음향상으로는 같은 정도의 통계적 유의도를 보일지라도, 인지측면에서는 많은 차이를 가져온다는 점을 보여주었다. 단일 신호측면에서는 VOT보다는 ‘선행모음’신호가 유/무성을 구분하는 더 우월한 신호임이 입증되었다. 그러나 상위범주로 구분이 되었을 때는 VOT가 속한 ‘개방단계신호’가 더 우월하였다. 따라서, 본 논문의 가설인 신호서열이 입증되었다.

(1) 영어 파열음 신호 서열

개방단계신호(파열, VOT, 후행모음구간) >> 접근단계신호(선행모음구간) >> 폐쇄단계신호(폐쇄묵음구간, 폐쇄중 유성구간)

개방단계신호와 접근단계신호는 폐쇄단계신호에 비하여 신호일치도 검사와 강인도 검사에서 우수한 신호로 판명되었다. 이는 신호정보량이 신호간에 똑같지 않다는 의미이다. 이론적으로, Nearey(1997)의 약이론이 타당함도 입증되었다. 이는 시구간과 스펙트럼에 의하여 구현되는 음향단서와 음운구조사이에는 변화가 많고 환경적으로 민감한 관계이기 때문이며, 음향이 갖는 풍부한 정보와 가변적인 음성신호와 불변하는 음운구조를 자각하는 인지능력사이에는 비대칭이 존재하기 때문이다.
실험결과는 신호허가이론(Licensing by Cue: Steriade 1995, 1997, 2001)에 의문을 제기할 수 있는 근거를 제공한다. 이 이론에 의하면, 파열음의 대조/중화는 음절위치에 의해서가 아니라, 각 환경에 구현되는 신호정보량 때문에 일어난다고 보았다. 예를 들어, VCV 환경에서의 음대조가 VC 환경보다 활발한 것은 상대적으로 많은 음향신호가 출현하였고, 따라서 신호정보량이 많아지고, 바로 이런 이유로 음대조를 갖고 오게 되는 것이다.
본 논문에서는 음향과 인지, 발화와 청각간의 비대칭에 대하여 연구를 하였다. 그 결과 인지측면에서 신호간의 서열이 존재함을 증명하였으며, 이는 모국어를 기반으로 하는 청자가 갖는 청각적인 특성 때문임을 보여주었다. 결국 이러한 비대칭적인 요소는, 음향신호를 기반으로 하는 신호허가이론에 대하여 의문을 제기한다

색인어
파열음, 폐쇄음, 유∙무성성, 인지, 청각, 음향, 신호, 차폐, 신호허가이론, 비대칭성

이 연구과제의 신청시 심사신청분야(최대 3순위까지 신청 가능)
1순위 : 인문학 > 영어와문학 > 영어학 > 음성/음운론(영어학)
2순위 : 인문학 > 영어와문학 > 영어학 > 영어사

연구성과물 목록