보고서 상세정보

https://www.krm.or.kr/krmts/link.html?dbGubun=SD&m201_id=10016594&local_id=10023438

https://www.krm.or.kr/krmts/link.html?dbGubun=SD&m201_id=10016594&local_id=10023438

뇌정보처리에 기반한 음성의 특징추출 및 특징선택 - 한국어와 일본어 음성에 내재된 감정인식을 중심으로

이 보고서는 한국연구재단(NRF, National Research Foundation of Korea)이 지원한 연구과제( 뇌정보처리에 기반한 음성의 특징추출 및 특징선택 - 한국어와 일본어 음성에 내재된 감정인식을 중심으로 | 2008 년 신청요강 다운로드

| 이수영(한국과학기술원)

) 연구결과물 로 제출된 자료입니다.

한국연구재단 인문사회연구지원사업을 통해 연구비를 지원받은 연구자는 연구기간 종료 후 6개월 이내에 결과보고서를 제출하여야 합니다.(*사업유형에 따라 결과보고서 제출 시기가 다를 수 있음.)

연구자가 한국연구재단 연구지원시스템에 직접 입력한 정보입니다.

연구과제번호	D00091
선정년도	2008 년
과제진행현황	종료
제출상태	재단승인
등록완료일	2010년 03월 02일
연차구분	결과보고
결과보고년도	2010년

결과보고시 연구요약문

국문
음성에 내재한 감정의 인식의 인식을 위하여 음성의 주 정보인 언어적 요소를 줄이고 미묘한 감정의 차이를 강조하는 특징추출 기법을 2가지 개발하였다. 먼저 음의 세기와 피치의 시간 변화에 따라 200가지의 통계적 특성을 추출하는 것을 기본으로 한다. 첫번째 알고리 ...

음성에 내재한 감정의 인식의 인식을 위하여 음성의 주 정보인 언어적 요소를 줄이고 미묘한 감정의 차이를 강조하는 특징추출 기법을 2가지 개발하였다. 먼저 음의 세기와 피치의 시간 변화에 따라 200가지의 통계적 특성을 추출하는 것을 기본으로 한다. 첫번째 알고리즘(표현적 NMF; representative-NMF; rNMF)은 이로부터 복잡한 음의 특징을 가장 잘 나타내는 NMF 특징을 선택한다. 이를 위해 먼저 자율학습기법에 의해 많은 수의 상호 독립인 특징을 추출하고, 이 중에서 인식에 기여도가 큰 소소의 특징을 선별하게 된다. 두번째 알고리즘(변별적 NMF; discriminative-NMF; dNMF)은 처음부터 인식에 도움이 되는 특징만을 추출한다. 즉, 기존의 NMF 학습법칙에 인식기여도를 나타내는 Fisher’s Linear Discriminant 함수를 최대화하는 조건을 부가적으로 추가하여 특징을 추출하게 된다. 음성으로부터 감정을 인식하는 보편적 데이터베이스(Berlin emotional database)에 대해서 두 가지 알고리즘 모두 20개의 특징 만으로도 기존의 특징추출 기법에 비해 우수한 인식성능을 입증하였다.

영문
For the emotion recognition in speech we had developed two feature extraction algorithms, which emphasize the subtle emotional differences while de-emphasizing the dominant linguistic components. The starting point is to extract 200 statistical featur ...

For the emotion recognition in speech we had developed two feature extraction algorithms, which emphasize the subtle emotional differences while de-emphasizing the dominant linguistic components. The starting point is to extract 200 statistical features based on intensity and pitch time series, which are considered as the superset of necessary emotional features. Then, the first algorithm, rNMF (representative Non-negative Matrix Factorization), selects simple features best representing the complex NMF-based features. It first extracts a large set of complex almost-mutually-independent features by unsupervised learning and latter selects a small number of simple features for the classification tasks. The second algorithm, dNMF (discriminant NMF), extracts only the discriminate features by adding Fisher criterion as an additional constraint on the cost function of the standard NMF algorithm. Both algorithms demonstrate much better recognition rates even with only 20 features for the popular Berlin database.

연구결과보고서

초록
음성에 내재한 감정의 인식의 인식을 위하여 음성의 주 정보인 언어적 요소를 줄이고 미묘한 감정의 차이를 강조하는 특징추출 기법을 2가지 개발하였다. 먼저 음의 세기와 피치의 시간 변화에 따라 200가지의 통계적 특성을 추출하는 것을 기본으로 한다. 첫번째 알고리 ...

음성에 내재한 감정의 인식의 인식을 위하여 음성의 주 정보인 언어적 요소를 줄이고 미묘한 감정의 차이를 강조하는 특징추출 기법을 2가지 개발하였다. 먼저 음의 세기와 피치의 시간 변화에 따라 200가지의 통계적 특성을 추출하는 것을 기본으로 한다. 첫번째 알고리즘(표현적 NMF; representative-NMF; rNMF)은 이로부터 복잡한 음의 특징을 가장 잘 나타내는 NMF 특징을 선택한다. 이를 위해 먼저 자율학습기법에 의해 많은 수의 상호 독립인 특징을 추출하고, 이 중에서 인식에 기여도가 큰 소소의 특징을 선별하게 된다. 두번째 알고리즘(변별적 NMF; discriminative-NMF; dNMF)은 처음부터 인식에 도움이 되는 특징만을 추출한다. 즉, 기존의 NMF 학습법칙에 인식기여도를 나타내는 Fisher’s Linear Discriminant 함수를 최대화하는 조건을 부가적으로 추가하여 특징을 추출하게 된다. 음성으로부터 감정을 인식하는 보편적 데이터베이스(Berlin emotional database)에 대해서 두 가지 알고리즘 모두 20개의 특징 만으로도 기존의 특징추출 기법에 비해 우수한 인식성능을 입증하였다.

For the emotion recognition in speech we had developed two feature extraction algorithms, which emphasize the subtle emotional differences while de-emphasizing the dominant linguistic components. The starting point is to extract 200 statistical features based on intensity and pitch time series, which are considered as the superset of necessary emotional features. Then, the first algorithm, rNMF (representative Non-negative Matrix Factorization), selects simple features best representing the complex NMF-based features. It first extracts a large set of complex almost-mutually-independent features by unsupervised learning and latter selects a small number of simple features for the classification tasks. The second algorithm, dNMF (discriminant NMF), extracts only the discriminate features by adding Fisher criterion as an additional constraint on the cost function of the standard NMF algorithm. Both algorithms demonstrate much better recognition rates even with only 20 features for the popular Berlin database.

연구결과 및 활용방안
학문적으로는 음성에 내재한 미묘한 감정의 특징을 효과적으로 추출하고 선택하는 기법은 다양한 패턴인식에 활용될 것이다. 음성과 영상을 복합적으로 사용하는 감정인식 기법으로 발전시킬 수 있다. 또한, 뇌파나 눈동자, 심전도 등 다양한 생체신호를 이용한 인간의 의 ...

학문적으로는 음성에 내재한 미묘한 감정의 특징을 효과적으로 추출하고 선택하는 기법은 다양한 패턴인식에 활용될 것이다. 음성과 영상을 복합적으로 사용하는 감정인식 기법으로 발전시킬 수 있다. 또한, 뇌파나 눈동자, 심전도 등 다양한 생체신호를 이용한 인간의 의도파악에서 미세한 차이를 강조하는 특징추출을 통해 인식성능을 높일 것이다. 산업적으로는 기계와 인간의 보다 긴밀한 관계를 발전시켜, 지능형 로봇과 음성산업의 발전을 위해 활용될 것이다.

For academic research the developed selection and extraction algorithms of delicate emotional features wiltl be applicable to many pattern recognition applications. It may be further extended to the hybrid audio-visual recognition of human emotions. Also, it will improve the recognition performance for human intention from diverse biosignals such as EEG, eye gaze, and ECG. For industries it will contribute to the intelligent robot and speech recognition industries by providing close relationship between human and machine.

색인어
변별적 특징, 특징추출, 특징선택, 비음수행렵분석, 피셔 선형변별력, discriminant feature, feature extraction, feature selection, NMF, Fisher's linear discriminant

이 연구과제의 신청시 심사신청분야(최대 3순위까지 신청 가능)
1순위 : 공학 > 전자/정보통신공학 > 신호처리 > 음성신호처리
2순위 : 공학 > 전자/정보통신공학 > 신경망 > 신경회로망

이 보고서에 대한 디지털 콘텐츠 목록

본 자료는 원작자를 표시해야 하며 영리목적의 저작물 이용을 허락하지 않습니다.
또한 저작물의 변경 또는 2차 저작을 허락하지 않습니다.