Voice Recognition and voice recording on the Stroop Task

Back

Home
»
Millisecond Forums
»
Inquisit 3
»
Voice Recognition and voice recording on the Stroop Task

Voice Recognition and voice recording on the Stroop Task

Author	Message
jgranadossamayoa	jgranadossamayoa posted 9 Years Ago HOT Topic Details Share Topic
Group: Forum Members Posts: 11, Visits: 38	Is there a way that Inquisit can recognize verbal responses and simultaneously save an audio file containing the responses? I know that there are scripts posted in the library that can do one at a time but I would like to combine these in one script. Any input would be appreciated.
	Like 181
Dave	Dave posted 9 Years Ago ANSWER Post Details Share Post
Group: Administrators Posts: 12K, Visits: 98K	No, Inquisit can't do both at the same time. Your options are to either (1) have Inquisit do voice recognition and have a recording program running in the background that records the entire session to an audio file, or (2) have Inquisit record the audio and run recognition over the recorded audio after data collection. Edited 9 Years Ago by Dave
	Like 173
jgranadossamayoa	jgranadossamayoa posted 9 Years Ago Post Details Share Post
Group: Forum Members Posts: 11, Visits: 38	Thank you very much.
	Like 159
seandr	seandr posted 8 Years Ago Post Details Share Post
Group: Administrators Posts: 1.3K, Visits: 5.6K	Just a quick follow up - Inquisit 5 introduces a new soundcapture command that allows you to record audio for the entire duration of a trial, block, or experiment. For example, you can record a participant speaking for a fixed duration of time, or until the participant clicks a continue button. Or, you could record audio for the entire length of an experiment. This feature is currently supported in Inquisit 5 Lab only. Support for uploading these and other media data with Inquisit Web will be added in a future release. Thanks, Sean Edited 8 Years Ago by seandr
	Like 156
luis	luis posted 7 Years Ago Post Details Share Post
Group: Forum Members Posts: 8, Visits: 41	+x seandr - Monday, October 10, 2016 Just a quick follow up - Inquisit 5 introduces a new soundcapture command that allows you to record audio for the entire duration of a trial, block, or experiment. For example, you can record a participant speaking for a fixed duration of time, or until the participant clicks a continue button. Or, you could record audio for the entire length of an experiment. This feature is currently supported in Inquisit 5 Lab only. Support for uploading these and other media data with Inquisit Web will be added in a future release. Thanks, Sean Why do you need to record responses? I am running Stroop tasks using speech recognition, and I've found that the engine is perfectly able to recognize and encode the responses, compute RTs, and give appropriate feedback. Incidentally, I am curious about whether the level of precision of these voice-relayed RTs could be comparable to those given on a keyboard... Anyone knows? Thanks, Luis
	Like 102
Dave	Dave posted 7 Years Ago Post Details Share Post
Group: Administrators Posts: 12K, Visits: 98K	+x luis - Friday, October 20, 2017 +x seandr - Monday, October 10, 2016 Just a quick follow up - Inquisit 5 introduces a new soundcapture command that allows you to record audio for the entire duration of a trial, block, or experiment. For example, you can record a participant speaking for a fixed duration of time, or until the participant clicks a continue button. Or, you could record audio for the entire length of an experiment. This feature is currently supported in Inquisit 5 Lab only. Support for uploading these and other media data with Inquisit Web will be added in a future release. Thanks, Sean Why do you need to record responses? I am running Stroop tasks using speech recognition, and I've found that the engine is perfectly able to recognize and encode the responses, compute RTs, and give appropriate feedback. Incidentally, I am curious about whether the level of precision of these voice-relayed RTs could be comparable to those given on a keyboard... Anyone knows? Thanks, Luis Computers generally are imperfect recognizers of human speech. Voice recognition will work very well under good controlled conditions, but for anything else having recordings may be preferable, which is why the option to record exists. A few examples. - Tasks with only a limited number of response options (like a typical, simple Stroop paradigm) are more amenable to voice recognition than other kinds of tasks involving a larger number of e.g. similar-sounding response options ("rat" and "red" are easier to confuse for a computer than for a human listener). - Voice recognition engines are not available for all languages, and recognition capabilities for some languages are more mature than for others. - Environmental and speaker factors can adversely affect recognition accuracy and latency determination (noisy surroundings, bad microphone, bad microphone placement, speaking in a low or hushed voice, coughing). - Voice recognition engines over time adapt to a given speaker, i.e. the more the engine has been trained to an individual speaker, the better the results will be. The less it has been trained, the likelier mistakes in automatic recognition are. In short, under some circumstances, recording the audio and determining responses and (re-)measuring latency after the fact based on the recordings may be preferable. Hope this clarifies.
	Like 113
luis	luis posted 7 Years Ago ANSWER Post Details Share Post
Group: Forum Members Posts: 8, Visits: 41	+x Dave - Friday, October 20, 2017 +x luis - Friday, October 20, 2017 +x seandr - Monday, October 10, 2016 Just a quick follow up - Inquisit 5 introduces a new soundcapture command that allows you to record audio for the entire duration of a trial, block, or experiment. For example, you can record a participant speaking for a fixed duration of time, or until the participant clicks a continue button. Or, you could record audio for the entire length of an experiment. This feature is currently supported in Inquisit 5 Lab only. Support for uploading these and other media data with Inquisit Web will be added in a future release. Thanks, Sean Why do you need to record responses? I am running Stroop tasks using speech recognition, and I've found that the engine is perfectly able to recognize and encode the responses, compute RTs, and give appropriate feedback. Incidentally, I am curious about whether the level of precision of these voice-relayed RTs could be comparable to those given on a keyboard... Anyone knows? Thanks, Luis Computers generally are imperfect recognizers of human speech. Voice recognition will work very well under good controlled conditions, but for anything else having recordings may be preferable, which is why the option to record exists. A few examples. - Tasks with only a limited number of response options (like a typical, simple Stroop paradigm) are more amenable to voice recognition than other kinds of tasks involving a larger number of e.g. similar-sounding response options ("rat" and "red" are easier to confuse for a computer than for a human listener). - Voice recognition engines are not available for all languages, and recognition capabilities for some languages are more mature than for others. - Environmental and speaker factors can adversely affect recognition accuracy and latency determination (noisy surroundings, bad microphone, bad microphone placement, speaking in a low or hushed voice, coughing). - Voice recognition engines over time adapt to a given speaker, i.e. the more the engine has been trained to an individual speaker, the better the results will be. The less it has been trained, the likelier mistakes in automatic recognition are. In short, under some circumstances, recording the audio and determining responses and (re-)measuring latency after the fact based on the recordings may be preferable. Hope this clarifies. Thank you Dave, I just wanted to use this line to ask if you have information about the time accuracy of the voice-relayed RTs. I have seen that, in some result files, RTs start fine over the very first trials, but then, after a few trials, they adopt what seems to me like a suspicious regularity (for instance, all RTs ending in 0, like 540; 630;440;490;...). I guess this must reflect a problem in how the system detects latency, but I wonder if you might have any idea of why this occurs, why it only occurs "sometimes", and if it could be remedied. Many thanks Luis
	Like 109
Dave	Dave posted 7 Years Ago Post Details Share Post
Group: Administrators Posts: 12K, Visits: 98K	+x luis - Saturday, October 21, 2017 +x Dave - Friday, October 20, 2017 +x luis - Friday, October 20, 2017 +x seandr - Monday, October 10, 2016 Just a quick follow up - Inquisit 5 introduces a new soundcapture command that allows you to record audio for the entire duration of a trial, block, or experiment. For example, you can record a participant speaking for a fixed duration of time, or until the participant clicks a continue button. Or, you could record audio for the entire length of an experiment. This feature is currently supported in Inquisit 5 Lab only. Support for uploading these and other media data with Inquisit Web will be added in a future release. Thanks, Sean Why do you need to record responses? I am running Stroop tasks using speech recognition, and I've found that the engine is perfectly able to recognize and encode the responses, compute RTs, and give appropriate feedback. Incidentally, I am curious about whether the level of precision of these voice-relayed RTs could be comparable to those given on a keyboard... Anyone knows? Thanks, Luis Computers generally are imperfect recognizers of human speech. Voice recognition will work very well under good controlled conditions, but for anything else having recordings may be preferable, which is why the option to record exists. A few examples. - Tasks with only a limited number of response options (like a typical, simple Stroop paradigm) are more amenable to voice recognition than other kinds of tasks involving a larger number of e.g. similar-sounding response options ("rat" and "red" are easier to confuse for a computer than for a human listener). - Voice recognition engines are not available for all languages, and recognition capabilities for some languages are more mature than for others. - Environmental and speaker factors can adversely affect recognition accuracy and latency determination (noisy surroundings, bad microphone, bad microphone placement, speaking in a low or hushed voice, coughing). - Voice recognition engines over time adapt to a given speaker, i.e. the more the engine has been trained to an individual speaker, the better the results will be. The less it has been trained, the likelier mistakes in automatic recognition are. In short, under some circumstances, recording the audio and determining responses and (re-)measuring latency after the fact based on the recordings may be preferable. Hope this clarifies. Thank you Dave, I just wanted to use this line to ask if you have information about the time accuracy of the voice-relayed RTs. I have seen that, in some result files, RTs start fine over the very first trials, but then, after a few trials, they adopt what seems to me like a suspicious regularity (for instance, all RTs ending in 0, like 540; 630;440;490;...). I guess this must reflect a problem in how the system detects latency, but I wonder if you might have any idea of why this occurs, why it only occurs "sometimes", and if it could be remedied. Many thanks Luis The engine will adapt to the current speaker over the course of the experiment, and generally get better at recognizing her/his responses; that may account for greater variability over the first few trials. This, however, should not lead to a suspicious regularity pattern in later trials -- I'm afraid I have no spontaneous idea where that comes from.
	Like 103