Speech Emotion Recognition (SER) - lesson 2 - Dataset

Speech Emotion Recognition (SER) is the task of recognizing the emotional aspects of speech irrespective of the semantic contents. While humans can efficiently perform this task as a natural part of speech communication, the ability to conduct it automatically using programmable devices is still an ongoing subject of research.

Speech is the most natural way of expressing ourselves as humans. It is only natural then to extend this communication medium to computer applications. We define speech emotion recognition (SER) systems as a collection of methodologies that process and classify speech signals to detect the embedded emotions.

Deep learning is a machine learning technique that teaches computers to do what comes naturally to humans: learn by example. Deep learning is a key technology behind driverless cars, enabling them to recognize a stop sign, or to distinguish a pedestrian from a lamppost.

Artificial intelligence is the simulation of human intelligence processes by machines, especially computer systems. Specific applications of AI include expert systems, natural language processing, speech recognition and machine vision.

Speech Emotion Recognition is the act of attempting to recognize human emotion and affective states from speech. This is capitalizing on the fact that voice often reflects underlying emotion through tone and pitch.

Download the Dataset:
https://drive.google.com/file/d/1wWsrN2Ep7x6lWqOXfr4rpKGYrJhWc8z7/view

التعرف على العاطفة و المشاعر في الكلام - الدرس 2 - قاعدة المعطيات

التعرف على المشاعر في الكلام هو مهمة التعرف على الجوانب العاطفية للكلام بغض النظر عن المحتويات الدلالية. في حين أن البشر يمكنهم أداء هذه المهمة بكفاءة كجزء طبيعي من الاتصال الكلامي ، فإن القدرة على إجرائها تلقائيًا باستخدام أجهزة قابلة للبرمجة لا تزال موضوع بحث مستمر.

الكلام هو الطريقة الطبيعية للتعبير عن أنفسنا كبشر. من الطبيعي عندئذٍ توسيع وسيلة الاتصال هذه لتشمل تطبيقات الكمبيوتر. نحدد أنظمة التعرف على المشاعر الكلامية على أنها مجموعة من المنهجيات التي تعالج وتصنف إشارات الكلام لاكتشاف المشاعر المضمنة.

التعلم العميق هو أسلوب تعلم آلي يعلم أجهزة الكمبيوتر أن تفعل ما هو طبيعي للبشر: التعلم بالقدوة. التعلم العميق هو تقنية أساسية وراء السيارات ذاتية القيادة ، مما يمكّنها من التعرف على علامة التوقف ، أو تمييز المشاة عن عمود الإنارة.

الذكاء الاصطناعي هو محاكاة عمليات الذكاء البشري بواسطة الآلات ، وخاصة أنظمة الكمبيوتر. تشمل التطبيقات المحددة للذكاء الاصطناعي الأنظمة الخبيرة ومعالجة اللغة الطبيعية والتعرف على الكلام ورؤية الآلة.

التعرف على المشاعر في الكلام هو محاولة التعرف على المشاعر الإنسانية والحالات العاطفية من الكلام. هذا هو الاستفادة من حقيقة أن الصوت غالبًا ما يعكس العاطفة الأساسية من خلال النغمة والنبرة.

تحميل قاعدة المعطيات المستخدمة :
https://drive.google.com/file/d/1wWsrN2Ep7x6lWqOXfr4rpKGYrJhWc8z7/view