Capabilities

Kid Smart AI leverages a range of open-source and proprietary technology to build cutting edge AI solutions for industries who build products for children, like education, toys and games, and content generation.

Capability	Description	Endpoint
Verbal fluency assessment	Assesses the fluency of verbal literacy for any given text.	/v1/audio/fluency
Pronunciation assessment	Evaluates the accuracy of pronunciation for spoken language.	/v1/audio/pronunciation
Word Recognition	Evaluates accuracy of spoken words/phrases (up to 30 seconds).	/v1/audio/recognition
Decodable fictional stories	Generates personalized fictional stories for readers.	/v1/text/decodable-stories
Image generation	Creates personalized and consistent character illustrations.	/v1/image/character-generation

Verbal fluency assessment

The Verbal fluency assessment product provides fast, accurate analysis of a fluency assessment of a child. The student is recorded reading a passage, then that recording and an ID for the text that was read are passed to Kid Smart AI API.

Inputs:

Audio file (mp3, mp4, etc, specified here)
ID of text that was provided to the child
- The texts must be provided by the organization, or they can use any Kid Smart AI text

Outputs for Verbal Fluency Assessment

Output Description	Details
Words per minute (WPM)	Measures how many words the child reads per minute.
Accuracy of reading	Percentage of words read correctly (calculated as 100 * (Number of words correctly read / total number of words)).
Errors identified	Classifications include Pronunciation, Omission, Insertion, Self-correction, Repetition.
Timestamps of errors	(Optional) Provides the specific times at which errors occurred during the reading.

The analysis occurs after the submission of the audio file, typically within 30 seconds of submission.

Best practices

In noisy environments (like a typical classroom), please use a headset for the audio collection. a. If you cannot make out the words from an audio recording, neither can Kid Smart AI
Allow the child the opportunities to self correct

Pronunciation Assessment

Inputs:

Audio file (wav only, specified here)
Expected Phoneme or phonemes (this can be in Arpabet, or in your own phonenic alphabet)
- Discuss with Kid Smart AI your phonemes, we can implement it.
Model ID
- We can build custom model outputs for your use case, or you can select from one of our options based on confidence (high vs medium) and phoneme segmentation requirements (none vs segmented)
- Please see playground examples to see the differences.

Outputs for Phoneme assessment

Output Description	Details
Is Correct?	Boolean indicating whether the phonemes were correctly pronounced
Analysis details	Timestamps and the predicted phonemes. If there is uncertainty in the phoneme prediction, multiple phonemes are given
Feedback	Feedback any errors that were identified what was identified

The analysis occurs after the submission of the audio file, typically within 30 seconds of submission.

Decodable story generation (coming soon)

Inputs:

Select a scope and sequence (select from ours, or provide your own)
Specify the level on the provided scope and sequence
Specify topic of story
Specify length of story
Optional: Specify any skill that should be used more often, default is the the level
Optional: Specify additional words to use

Outputs: Decodable text using only the vocabulary provided of the specified length

Capabilities

Verbal fluency assessment​

Outputs for Verbal Fluency Assessment​

Best practices​

Pronunciation Assessment​

Outputs for Phoneme assessment​

Decodable story generation (coming soon)​

Verbal fluency assessment

Outputs for Verbal Fluency Assessment

Best practices

Pronunciation Assessment

Outputs for Phoneme assessment

Decodable story generation (coming soon)