Journal article Open Access

A Brief Survey on Emotion Based Text to Speech Conversion System

Supriya Dhanaraj Dhumale; Manjiri Vitthal Khopade; Bhushan Dhimate; Avadhoot Yogesh Dhere


DataCite XML Export

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd">
  <identifier identifierType="URL">https://zenodo.org/record/5514958</identifier>
  <creators>
    <creator>
      <creatorName>Supriya Dhanaraj Dhumale</creatorName>
      <affiliation>Department of Computer Science, Savitribai Phule Pune University, Pune (Maharashtra), India.</affiliation>
    </creator>
    <creator>
      <creatorName>Manjiri Vitthal Khopade</creatorName>
      <affiliation>Department of Computer Science, Savitribai Phule Pune University, Pune (Maharashtra), India</affiliation>
    </creator>
    <creator>
      <creatorName>Bhushan Dhimate</creatorName>
      <affiliation>Department of Computer Science, Savitribai Phule Pune University, Pune (Maharashtra), India.</affiliation>
    </creator>
    <creator>
      <creatorName>Avadhoot Yogesh Dhere</creatorName>
      <affiliation>Department of Computer Science, Savitribai Phule Pune University, Pune (Maharashtra), India.</affiliation>
    </creator>
  </creators>
  <titles>
    <title>A Brief Survey on Emotion Based Text to Speech Conversion System</title>
  </titles>
  <publisher>Zenodo</publisher>
  <publicationYear>2021</publicationYear>
  <subjects>
    <subject>Emotion recognition, Text to Speech, GRU.</subject>
    <subject subjectScheme="issn">2231-2307</subject>
    <subject subjectScheme="handle">100.1/ijsce.A35290911121</subject>
  </subjects>
  <dates>
    <date dateType="Issued">2021-09-30</date>
  </dates>
  <language>en</language>
  <resourceType resourceTypeGeneral="JournalArticle"/>
  <alternateIdentifiers>
    <alternateIdentifier alternateIdentifierType="url">https://zenodo.org/record/5514958</alternateIdentifier>
  </alternateIdentifiers>
  <relatedIdentifiers>
    <relatedIdentifier relatedIdentifierType="ISSN" relationType="IsCitedBy" resourceTypeGeneral="JournalArticle">2231-2307</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsIdenticalTo">10.35940/ijsce.A3529.0911121</relatedIdentifier>
  </relatedIdentifiers>
  <rightsList>
    <rights rightsURI="https://creativecommons.org/licenses/by/4.0/legalcode">Creative Commons Attribution 4.0 International</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
  </rightsList>
  <descriptions>
    <description descriptionType="Abstract">&lt;p&gt;Text to speech conversion is one of the applications of machine learning. It is widely used in search engines, standalone applications, web applications, chatbots and android applications. But still there is need to upgrade text to speech system so that we can get more interactive and user-friendly application. Traditional text to speech application has monotonous voice as output which does not has emotions in it and seems to be more mechanized. So, there is need to improvise the existing system by embedding the flavour of emotions in it. Existing text to speech cannot be used in story telling applications also it does not provide effective communication. Most of the Text to Speech systems are developed using algorithms such as Support Vector Machine (SVM), Na&amp;iuml;ve Bayes etc. Emotion Based Text to Speech System will help to improvise the existing Text to Speech system. With the help of machine learning and deep learning algorithm such as Recurrent Neural Network can be used for performing sentiment analysis and semantic analysis on the input text. We are going to use neural network which is more effective and help to maintain a relation between previous word and next word. Emotion based text to speech system will be able to identify four emotions &amp;lsquo;happy&amp;rsquo;, &amp;lsquo;sad&amp;rsquo;, &amp;lsquo;angry&amp;rsquo; and &amp;lsquo;neutral&amp;rsquo;. Emotion based text to speech system will be beneficial for educational purpose like listening stories from storytelling applications for young budding children. Emotion based text to speech is going to be serviceable for visually impaired individuals.&lt;/p&gt;</description>
  </descriptions>
</resource>
19
18
views
downloads
Views 19
Downloads 18
Data volume 5.4 MB
Unique views 19
Unique downloads 18

Share

Cite as