Skip to contents
climasus4r 0.0.0.9000 (development version)
Phase 1: Data Infrastructure (π vs β
)
New Features
-
Data Import
- π
sus_data_import(): Import DATASUS data with parallel processing support
- Automatic caching to avoid redundant downloads
- Support for multiple states and years in a single call
- Progress tracking with informative CLI messages
- Error handling with detailed reporting
-
Data Cleaning
- π
sus_data_clean_encoding(): Automatic detection and correction of character encoding issues
- Handles common Latin1/UTF-8 conflicts in Brazilian Portuguese text
- Reports which columns were corrected
-
Data Standardization
- π
sus_data_standardize(): Comprehensive column name and value standardization
- Translates 80+ Portuguese column names to English
- Standardizes categorical values (sex, race, marital status, etc.)
- Option to keep original columns for comparison
-
Disease Filtering
- π
sus_data_filter_cid(): Flexible ICD-10 code filtering
- Support for single codes, code ranges, and entire chapters
- Multiple matching strategies (exact, starts_with, range)
- Auto-detection of ICD column
Roadmap
Phase 2: Socioeconomic Integration (Planned)
- Geographic boundary linking
- IBGE socioeconomic data integration
- Population-weighted spatial operations
Phase 3: Environmental Integration (Planned)
- INMET meteorological data import
- Air quality data integration
- Exposure matching algorithms
Phase 4: Spatial Analysis (Planned)
- Bayesian spatial smoothing
- Spatial cluster detection
- Local indicators of spatial association
Phase 5: Temporal & Predictive Analysis (Planned)
- Distributed lag non-linear models (DLNM)
- Attributable fraction calculation
- Machine learning prediction wrappers