Interactive Command Builder¶
Build your OPAL commands with this interactive tool. Select your options and get the exact command to run.
Quick Examples Gallery¶
Click any example to load it into the builder:
๐ฐ Basic News Scraping
Scrape recent articles from 1819 News
python -m opal --url https://1819news.com/ --parser Parser1819 --suffix /news/item --max_pages 5
โ๏ธ Basic Court Cases
Extract court cases from Alabama Appeals
python -m opal --url https://publicportal.alappeals.gov/portal/search/case/results --parser ParserAppealsAL --max_pages 3
๐ Daily News Digest
Quick daily update from Alabama Daily News
python -m opal --url https://www.aldailynews.com/ --parser ParserDailyNews --max_pages 2
๐๏ธ Civil Court Search
Search civil court cases from last week
python -m opal.configurable_court_extractor --court civil --date-period 7d --exclude-closed
๐ Custom Court Search
Advanced court search with filters
python -m opal.configurable_court_extractor --court civil --date-period 1m --case-category Appeal --max-pages 10
๐ฌ Research Mode
Comprehensive data collection for analysis
python -m opal --url https://1819news.com/ --parser Parser1819 --max_pages 20
Interactive Command Builder¶
Build Your Command
Generated Command¶
Select options above to generate your command...
๐ก Pro Tip: After copying your command, open your terminal, activate your virtual environment with
source venv/bin/activate
(or venv\Scripts\activate
on Windows), then paste and run the command.
Command Validation¶
What Happens Next?¶
After running your command:
- For News Scraping: You'll get a JSON file with articles including title, author, date, and full content
- For Court Scraping: You'll get both JSON and CSV files with case details, plus progress updates in the terminal
- Files are named with timestamps for easy organization
- Check the terminal output for any warnings or helpful information
Advanced Options¶
Environment Variables¶
You can set these environment variables to customize behavior:
# Set default output directory
export OPAL_OUTPUT_DIR="/path/to/outputs"
# Set default rate limiting (seconds between requests)
export OPAL_RATE_LIMIT=2.0
# Enable debug mode
export OPAL_DEBUG=1
Combining with Other Tools¶
Process results with jq (JSON processor):
# Extract just article titles
python -m opal --url https://1819news.com/ --parser Parser1819 --max_pages 2 \
&& cat *Parser1819*.json | jq '.articles[].title'
# Count court cases by status
python -m opal.configurable_court_extractor --court civil --date-period 7d \
&& cat *civil*.json | jq '.cases | group_by(.status) | map({status: .[0].status, count: length})'
Save to specific directory:
mkdir -p ~/opal-results/$(date +%Y-%m)
cd ~/opal-results/$(date +%Y-%m)
python -m opal --url https://1819news.com/ --parser Parser1819 --max_pages 5
Need Help?¶
- Command not working? Check the Understanding Errors guide
- Want to see examples? Visit Common Use Cases
- Need setup help? Try the Prerequisites Checker