文本编辑器,可以将语音转换为文本,反之亦然
Text editor which can convert speech to text and vice-versa
我正在考虑为我的学术项目实现一个文本编辑器,它可以将语音转换为文本,也可以朗读书面文本。
是否可以在Python中编码?或者有可能吗?如果可以,怎么做?
感谢任何帮助。
是的,很有可能。如果您是初学者,我建议您使用 python 来执行此操作。您可以将 PyQt 用于 GUI,pyttsx and SpeechRecognition 用于语音引擎(离线)。执行以下操作来安装它们:
pip install SpeechRecognition
pip install pyttsx
这里有一些代码可以帮助您开始 python
中的语音识别
import speech_recognition
import pyttsx
speech_engine = pyttsx.init('sapi5') # see http://pyttsx.readthedocs.org/en/latest/engine.html#pyttsx.init
speech_engine.setProperty('rate', 150)
def speak(text):
speech_engine.say(text)
speech_engine.runAndWait()
recognizer = speech_recognition.Recognizer()
def listen():
with speech_recognition.Microphone() as source:
recognizer.adjust_for_ambient_noise(source)
audio = recognizer.listen(source)
try:
return recognizer.recognize_sphinx(audio)
# or: return recognizer.recognize_google(audio)
except speech_recognition.UnknownValueError:
print("Could not understand audio")
except speech_recognition.RequestError as e:
print("Recog Error; {0}".format(e))
return ""
speak("Say something!")
speak("I heard you say " + listen())
我正在考虑为我的学术项目实现一个文本编辑器,它可以将语音转换为文本,也可以朗读书面文本。
是否可以在Python中编码?或者有可能吗?如果可以,怎么做?
感谢任何帮助。
是的,很有可能。如果您是初学者,我建议您使用 python 来执行此操作。您可以将 PyQt 用于 GUI,pyttsx and SpeechRecognition 用于语音引擎(离线)。执行以下操作来安装它们:
pip install SpeechRecognition
pip install pyttsx
这里有一些代码可以帮助您开始 python
中的语音识别import speech_recognition
import pyttsx
speech_engine = pyttsx.init('sapi5') # see http://pyttsx.readthedocs.org/en/latest/engine.html#pyttsx.init
speech_engine.setProperty('rate', 150)
def speak(text):
speech_engine.say(text)
speech_engine.runAndWait()
recognizer = speech_recognition.Recognizer()
def listen():
with speech_recognition.Microphone() as source:
recognizer.adjust_for_ambient_noise(source)
audio = recognizer.listen(source)
try:
return recognizer.recognize_sphinx(audio)
# or: return recognizer.recognize_google(audio)
except speech_recognition.UnknownValueError:
print("Could not understand audio")
except speech_recognition.RequestError as e:
print("Recog Error; {0}".format(e))
return ""
speak("Say something!")
speak("I heard you say " + listen())