AWS Certified AI Practitioner(34) - Mechanical Turk, Augmented AI, Transcribe Medical, Comprehend Medical
Amazon Mechanical Turk (MTurk)
What it is
- Amazon Mechanical Turk (MTurk) is a crowdsourcing marketplace for simple human tasks (HITs, or Human Intelligence Tasks).
- It provides a distributed virtual workforceโhumans across the globe who can complete small tasks for pay.
- The name comes from an 18th-century chess-playing automaton called โThe Mechanical Turkโ, which was actually secretly operated by a human.
Example
- You have 10 million images and need labels.
- Instead of hiring a full-time workforce, you post this job on MTurk.
- Workers tag the images, and you set a reward (e.g., $0.10 per image โ $1M total cost).
Use cases
- Image classification
- Data collection
- Simple business processes (like surveys, data entry, content review)
Integration
- Works with Amazon A2I (Augmented AI) and SageMaker Ground Truth for large-scale data labeling for ML models.
๐ก Exam Tip:
When you see โlarge dataset labeling with human workforceโ โ MTurk. Often paired with A2I or Ground Truth in exam scenarios.
Amazon Augmented AI (A2I)
What it is
- Amazon A2I adds human oversight (Human-in-the-Loop, HITL) to ML predictions.
- Even the best ML models can be uncertain. A2I lets you send low-confidence predictions to humans for review.
- Humans can be:
- Your own employees
- AWSโs network of 500,000+ contractors
- MTurk workers
- Pre-screened third-party vendors (via AWS Marketplace)
How it works
- Input โ ML model makes a prediction.
- If confidence is high โ result is returned immediately.
- If confidence is low โ sent to A2I for human review.
- Reviewed outputs are stored (e.g., in S3) and can be fed back into training to improve the model.
Use cases
- Content moderation (Rekognition) โ humans review flagged images/videos.
- Text extraction (Textract) โ humans validate extracted key-value pairs.
- Custom ML โ define your own review workflow.
๐ก Exam Tip:
Key phrase โHuman-in-the-Loopโ = A2I. Often combined with Rekognition or Textract.
Amazon Transcribe Medical
What it is
- A version of Amazon Transcribe built specifically for the medical domain.
- HIPAA compliant, so safe for healthcare environments.
- Recognizes medical terminology: drug names, conditions, procedures, diseases.
Capabilities
- Real-time transcription (via microphone input).
- Batch transcription (upload audio files).
Use cases
- Doctors dictating patient notes directly into medical systems.
- Transcribing drug safety calls and adverse event reports.
๐ก Exam Tip:
If the scenario involves medical speech-to-text with HIPAA compliance โ Transcribe Medical.
Amazon Comprehend Medical
What it is
- A specialized version of Amazon Comprehend for healthcare data.
- Extracts structured information from unstructured clinical text
like:- Doctorโs notes
- Discharge summaries
- Lab/test results
- Case notes
Features
- Entity detection: meds, dosage, frequency, conditions, symptoms.
- Relationship extraction: links between medications, dosages, and conditions.
- PHI detection (DetectPHI API): identifies sensitive patient info for compliance.
Integration
- Documents stored in Amazon S3.
- Real-time data via Kinesis Data Firehose.
- Often used together with Transcribe Medical:
Speech โ Text (Transcribe) โ Text Analysis (Comprehend Medical).
๐ก Exam Tip:
When you see clinical text, PHI detection, or structured medical insights โ Comprehend Medical. Frequently paired with Transcribe Medical in exam questions.
Summary (for AWS Exam)
- MTurk: Crowdsourced workforce for labeling and simple tasks.
- A2I: Human review for ML predictions (Human-in-the-Loop).
- Transcribe Medical: Medical speech-to-text, HIPAA compliant.
- Comprehend Medical: Extract medical insights & PHI detection from text.
All articles on this blog are licensed under CC BY-NC-SA 4.0 unless otherwise stated.