Datasets
Looking for high-quality, annotated data for your machine learning applications? Explore our collection of off-the-shelf speech, image, and video data sets below. Most data sets have a downloadable sample file to give you a preview of the capabilities of our ready-to-order or highly customizable AI Data Services.

Speech Data
This data set contains recordings of call center conversations in Japanese (jp_JP).

Speech Data
This data set contains recordings of up to 1000 hours of call center conversations in US English (en_US).

Speech Data
Google Wake Words in US English (en_US) of 103 participants of age 19-68.

Speech Data
Siri Wake Words and Voice Commands in US English (en_US) of 103 participants of age 19-68.

Speech Data
500 hours of phone conversations in Japanese (jp_JP).

Speech Data
500 hours of phone conversations in Irish English (en_IE).

Image Data
62 different people, 187 eye gaze directions, 3 different head poses, and 347,820 eye gaze images.

Video Data
4 cameras recorded traffic (cars and pedestrians) at an intersection from either a 45 or 90 degree angle.

Speech Data
US English wake words using "Siri" from 103 participants of age 19-68.

Speech Data
US English voice commands including the wake word "OK Google" from 103 participants of age 19-68.

Speech Data
50 hours of phone conversations in Dutch (nl-NL).

Speech Data
Wake word "Alexa" in US English (en_US) of 103 participants of age 19-68.