Word Boosting
Word Boosting (also known as Keyword/Keyterm Prompting) allows you to improve Keyword Recall Rate (KRR) for important keywords or phrases by up to 90%. This feature helps ensure accurate transcription of industry-specific terminology, product names, company names, and specialized jargon.
Enabling Word Boosting
Add keywords parameter as an array of strings in the format word:weight where weight is a positive number indicating boost intensity.
Word Boosting is currently only available for the Real-Time WebSocket API.
Real-Time WebSocket API
Case Sensitivity and Formatting
Keywords preserve formatting (including case and punctuation) which helps control how proper nouns, product names, or company names are transcribed.
Best practices for keyword formatting:
- For proper nouns (names, brands, titles): Use appropriate capitalization (
Waves,iPhone,Dr. Smith) - For non-proper nouns: Use lowercase (
tretinoin,algorithm,protocol)
When smart formatting is applied to the transcript, words that start sentences may be automatically capitalized regardless of keyword formatting.
Expected Improvements
Word Boosting can significantly improve recognition accuracy and confidence scores for industry-specific terminology. Typical improvements include:
Best Practices for Keyword Selection
Industry-specific terminology
Medical terms (tretinoin:5.0, diagnosis:4.0), technical jargon (escalation:4.5, API:3.5)
Product and company names
Brand names (Waves:5.0, iPhone:5.0), service names, competitor names
Multi-word phrases
Common phrases in your domain (account number:4.0, customer service:3.5)
Proper nouns
Names, brands, titles with appropriate capitalization (Dr. Smith:5.0)
Common non-proper nouns
Use lowercase (algorithm:3.0, protocol:3.0, refill:4.0)
What to Avoid
Generic common words
Very common words that are rarely misrecognized (the, and, is) - don’t boost these
Overly broad terms
Words that appear in many contexts without specific meaning
Excessive keywords
Focus on the most important 20-50 terms with appropriate weight values
Inconsistent formatting
Ensure capitalization matches your desired output
Extreme values
Avoid using only very high weights (10+) or very low weights (1) - use a range (1-10) to differentiate importance levels

