A comprehensive workshop designed to teach developers how to leverage the Box AI API capabilities through hands-on Jupyter notebook exercises. This interactive learning experience covers the full spectrum of Box AI APIs, from basic document Q&A to advanced structured data extraction and custom AI agents.
This workshop provides practical, hands-on experience with:
- Document Intelligence: Ask questions and get answers from single documents or curated document collections
- Data Extraction: Extract structured and unstructured data from various document types (PDFs, Word docs, etc.)
- Box Hubs: Use Box's Retrieval Augmented Generation (RAG) capabilities across large document sets
- Custom AI Agents: Create and deploy specialized AI agents through Box AI Studio
- Enterprise Integration: Build production-ready workflows using the Box Python SDK
- Box Enterprise Account with the following enabled:
- Box AI APIs
- Box AI Studio
- Box Hubs
 
- Box Application configured with:
- Client Credentials authentication
- Manage AIscope enabled
- Application enabled in Box admin console
 
- Box User ID of the application creator
- Python 3.11+
- Jupyter Notebook or JupyterLab
- Ability to create Python virtual environments
- Basic familiarity with Python and REST APIs
Clone this repository and create a virtual environment:
git clone https://github.com/box-community/box-ai-api-masterclass.git
cd box-ai-api-master-class
# Create and activate virtual environment
python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activatepip install -r .requirements.txtOr install manually:
pip install box-sdk-gen python-dotenv jupyterjupyter notebook
# or
jupyter labImportant: Start with 1-Setup.ipynb first. This notebook will:
- Guide you through entering your Box credentials
- Create all necessary Box objects (folders, files, hubs, metadata templates, AI agents)
- Generate a .envfile with all required configuration
- Upload sample documents to Box for the exercises
.env file for convenience. In production, use secure credential management solutions.
Prerequisites Setup & Box Object Creation
- Configure authentication with Box APIs
- Create folder structure and upload sample documents
- Set up Box Hubs, metadata templates, and AI agents
- Generate environment configuration for subsequent exercises
Single Document Question Answering
- Learn the /askendpoint for document Q&A
- Implement conversation history for contextual follow-ups
- Work with citations and document references
- Sample Document: US Parole Commission policy guidelines
Multi-Document RAG with Box Hubs
- Query across curated document collections
- Understand Box's managed RAG implementation
- Analyze complex document relationships
- Sample Documents: Clinical drug trial documentation
Unstructured Data Extraction
- Use the /extractendpoint with natural language prompts
- Extract key-value pairs without predefined schemas
- Handle various document formats and structures
- Sample Document: W-2 tax form
Metadata Template-Based Extraction
- Leverage Box Metadata Templates for consistent extraction
- Process multiple documents with identical schemas
- Understand enterprise data standardization
- Sample Documents: Invoice collection
Advanced AI Agents & Custom Models
- Use Box's Enhanced Extract Agent for improved accuracy
- Create and deploy custom Box AI Studio agents
- Compare default vs. specialized agent performance
- Sample Documents: Legal due diligence and purchase agreements
Each exercise generates standalone Python scripts that you can use as:
- Starting points for your own projects
- Reference implementations for production workflows
- CLI tools for testing and development
Generated files:
- box_ai_qna_single.py- Interactive single document chat
- box_ai_qna_hub.py- Interactive multi-document chat
- box_ai_flexible_extract.py- Flexible data extraction
- box_ai_structured_extract.py- Template-based extraction
- box_ai_enhanced_extract.py- Enhanced extraction agent
- box_ai_studio_agent.py- Custom AI Studio agent
The workshop includes carefully selected sample documents that demonstrate real-world use cases:
- Government Policy Documents (Exercise 2)
- Clinical Trial Data (Exercise 3)
- Tax Forms (Exercise 4)
- Business Invoices (Exercise 5)
- Legal Contracts (Exercise 6)
- Stateless Conversation Management: Maintain context across API calls
- Citation Tracking: Trace AI responses back to source documents
- Batch Processing: Handle multiple documents efficiently
- Custom Field Definitions: Define extraction schemas programmatically
- Agent Customization: Tailor AI behavior for specific use cases
- Error Handling: Robust error management for production environments
While this workshop uses simplified authentication and storage for learning purposes, consider these factors for production deployments:
- Security: Use secure credential management (AWS Secrets Manager, Azure Key Vault, etc.)
- Scalability: Implement proper async/await patterns for concurrent processing
- Monitoring: Add logging, metrics, and error tracking
- Rate Limiting: Implement appropriate throttling for API calls
- Data Privacy: Ensure compliance with your organization's data handling policies
- Box Developer Documentation: developer.box.com
- Box AI APIs: Box AI Documentation
- Box Python SDK: box-python-sdk
- Box Community: Box Developer Community
If you encounter issues:
- Check Prerequisites: Ensure all Box features are enabled in your account
- Verify Setup: Confirm the setup notebook completed successfully
- Review Logs: Check Jupyter output for detailed error messages
- Community Support: Post questions in the Box Developer Community
Ready to get started? Open 1-Setup.ipynb and begin your Box AI journey!