Spaces:

InstaDeepAI
/

sentinel

Runtime error

App Files Files Community

sentinel / AGENTS.md

jeuko

Sync from GitHub (main)

c403899 verified about 2 months ago

preview code

raw

history blame contribute delete

15.8 kB

	# Repository Guidelines

	This repository contains the LLM-based Cancer Risk Assessment Assistant.

	## Core Technologies
	- FastAPI for the web framework
	- LangChain for LLM orchestration
	- uv for environment and dependency management
	- hydra: for configuration management

	## Development Setup

	### Environment Setup
	- Create the virtual environment (at '.venv') with `uv sync`.
	- As the repository uses uv, the uv should be used to run all commands, e.g., "uv run python ..." NOT "python ...".

	### Running Commands
	- Streamlit Interface: `uv run streamlit run apps/streamlit_ui/main.py`
	- CLI Demo: `uv run python apps/cli/main.py`
	- Tests: `uv run pytest`

	## Coding Standards

	### Coding Philosophy
	- Write simple, explicit, modular code
	- Prioritize clarity over cleverness
	- Prefer small pure functions over large ones
	- Return early instead of nesting deeply
	- Favor functions over classes unless essential
	- Favor simple replication over heavy abstraction
	- Keep comments short and only where code isn't self-explanatory
	- Avoid premature optimization or over-engineering

	### Variable Naming
	- Avoid single-letter variable names (x, y, i, j, e, t, f, m, c, ct) in favor of descriptive names.
	- Avoid abbreviations (fh, ct, w, h) in favor of full descriptive names.
	- Use context-specific names for loop indices based on what you're iterating over:
	- `item_index` for general enumeration
	- `line_index` for text line iteration
	- `column_index` for table/array column iteration
	- `row_index` for table/array row iteration
	- Use descriptive names for comprehensions and iterations:
	- `item` instead of `i` for general items
	- `element` instead of `e` for list elements
	- `key` instead of `k` for dictionary keys
	- `value` instead of `v` for dictionary values
	- Use descriptive names for coordinates and positions:
	- `x_position`, `y_position` instead of `x`, `y`
	- `width`, `height` instead of `w`, `h`
	- Use descriptive names for data structures:
	- `file_path` instead of `f` for file paths
	- `model` instead of `m` for model instances
	- `user` instead of `u` for user objects

	Examples from recent refactoring:
	- `for i, ref in enumerate(references)` → `for ref_index, ref in enumerate(references)`
	- `for e in examples` → `for example in examples`
	- `for m in models` → `for model in models`
	- `x = pdf.get_x()` → `x_position = pdf.get_x()`
	- `fh = family_history` → `family_history = family_history` (avoid abbreviations)
	- `ct for ct in cancer_types` → `cancer_type for cancer_type in cancer_types`
	- `f in MODELS_DIR.glob` → `file_path in MODELS_DIR.glob`
	- `t in field_type.__args__` → `type_arg in field_type.__args__`

	### Path Handling
	- Always use `pathlib.Path` for all file I/O, joining, and globbing
	- Accept `Path \| str` at function boundaries; normalize to `Path` internally
	- Never use `os.path` for path operations

	Example:
	```python
	from pathlib import Path

	def read_text(file: Path \| str) -> str:
	path = Path(file)
	return path.read_text(encoding="utf-8")
	```

	### Type Hints and Modern Python
	- Use modern type hints: `list`, `dict`, `tuple`, `set` (not `List`, `Dict`, etc.)
	- Use PEP 604 unions: `A \| B` (not `Union[A, B]` or `Optional[A]`)
	- Import from `typing` only when necessary (`TypedDict`, `Literal`, `Annotated`, etc.)
	- Never use `from __future__ import annotations`
	- Add type hints to all public functions and methods
	- Prefer precise types (`float`, `Path`, etc.) over generic ones
	- If `Any` is required, isolate and document why

	### Import Management
	- Place all imports at the top of the file, never inside functions or classes
	- Group imports in three sections with blank lines between:
	1. Standard library imports
	2. Third-party library imports
	3. Local/project imports
	- This improves performance (imports loaded once) and code readability

	### Error Handling and Logging
	- Use `try/except` only for I/O or external APIs
	- Catch specific exceptions only (never broad `except:`)
	- Raise clear, actionable error messages
	- Use `loguru` for logging, never `print()` in production code

	Example:
	```python
	from loguru import logger

	try:
	data = Path(file_path).read_text(encoding="utf-8")
	except FileNotFoundError as error:
	logger.error(f"Configuration file not found: {file_path}")
	raise ValueError(f"Missing required config: {file_path}") from error
	```

	### Docstring Standards
	- Use Google-style docstrings for all public functions and classes
	- Do NOT include type hints in docstrings (they're in the signature)
	- Describe behavior, invariants, side effects, and edge cases
	- Include examples for complex functions
	- Avoid verbose docstrings for simple, self-explanatory functions

	## Testing

	### Testing Philosophy
	- Write meaningful tests that verify core functionality and prevent regressions
	- Use `pytest` as the testing framework
	- Tests go under `tests/` mirroring the source layout
	- Test both valid and invalid input scenarios

	### Test Types
	- Unit tests: Small, deterministic, one concept per test
	- Integration tests: Real workflows or reference comparisons with external systems
	- Use `pytest.mark` to tag slow or manual tests

	### Test Coverage Requirements
	- Ensure comprehensive test coverage for all risk models
	- Ground Truth Validation: Test against known reference values
	- Input Validation: Test that invalid inputs raise `ValueError`
	- Edge Cases: Test boundary conditions
	- Inapplicable Cases: Test when models should return "N/A"

	### Running Tests
	```bash
	uv run pytest # Run all tests
	uv run pytest -q # Quiet mode
	uv run pytest -v # Verbose mode
	uv run pytest tests/test_risk_models/ # Specific directory
	```

	### Pre-Submission Checklist
	Before committing code, verify:
	1. ✅ Run `uv run pytest -q` (all tests pass)
	2. ✅ Run `pre-commit run --all-files` (all hooks pass)
	3. ✅ No `print()` statements in production code
	4. ✅ No broad `except:` blocks
	5. ✅ All type hints present on public functions
	6. ✅ File paths use `pathlib.Path`
	7. ✅ Logging uses `loguru`

	## Risk Models

	### Implemented Models

	The assistant currently includes the following built-in risk calculators:

	- Gail - Breast cancer risk
	- Claus - Breast cancer risk based on family history
	- Tyrer-Cuzick - Breast cancer risk (IBIS model)
	- BOADICEA - Breast and ovarian cancer risk (via CanRisk API)
	- PLCOm2012 - Lung cancer risk
	- LLPi - Liverpool Lung Project improved model for lung cancer risk (8.7-year prediction)
	- CRC-PRO - Colorectal cancer risk
	- PCPT - Prostate cancer risk
	- Extended PBCG - Prostate cancer risk (extended model)
	- Prostate Mortality - Prostate cancer-specific mortality prediction
	- MRAT - Melanoma risk (5-year prediction)
	- aMAP - Hepatocellular carcinoma (liver cancer) risk
	- QCancer - Multi-site cancer differential

	Additional models should follow the interfaces under `src/sentinel/risk_models`.

	### Risk Model Implementation Guide

	#### Base Architecture

	All risk models must inherit from `RiskModel` in `src/sentinel/risk_models/base.py`:

	```python
	from sentinel.risk_models.base import RiskModel

	class YourRiskModel(RiskModel):
	def __init__(self):
	super().__init__("your_model_name")
	```

	#### Required Methods

	Every risk model must implement these abstract methods:

	```python
	def compute_score(self, user: UserInput) -> str:
	"""Compute the risk score for a given user profile.

	Args:
	user: The user profile containing demographics, medical history, etc.

	Returns:
	str: Risk percentage as a string or an N/A message if inapplicable.

	Raises:
	ValueError: If required inputs are missing or invalid.
	"""

	def cancer_type(self) -> str:
	"""Return the cancer type this model assesses."""
	return "breast" # or "lung", "prostate", etc.

	def description(self) -> str:
	"""Return a detailed description of the model."""

	def interpretation(self) -> str:
	"""Return guidance on how to interpret the results."""

	def references(self) -> list[str]:
	"""Return list of reference citations."""
	```

	#### UserInput Structure

	All risk models must use the centralized `UserInput` structure - this is the single source of truth for all data types and enums. The `UserInput` class follows a hierarchical structure:

	```
	UserInput
	├── demographics: Demographics
	│ ├── age_years: int
	│ ├── sex: Sex (enum)
	│ ├── ethnicity: Ethnicity \| None
	│ └── anthropometrics: Anthropometrics
	│ ├── height_cm: float \| None
	│ └── weight_kg: float \| None
	├── lifestyle: Lifestyle
	│ ├── smoking: SmokingHistory
	│ └── alcohol: AlcoholConsumption
	├── personal_medical_history: PersonalMedicalHistory
	│ ├── chronic_conditions: list[ChronicCondition]
	│ ├── previous_cancers: list[CancerType]
	│ ├── genetic_mutations: list[GeneticMutation]
	│ └── tyrer_cuzick_polygenic_risk_score: float \| None
	├── female_specific: FemaleSpecific \| None
	│ ├── menstrual: MenstrualHistory
	│ ├── parity: ParityHistory
	│ └── breast_health: BreastHealthHistory
	├── symptoms: list[SymptomEntry]
	└── family_history: list[FamilyMemberCancer]
	```

	#### REQUIRED_INPUTS Specification

	Every risk model must define a `REQUIRED_INPUTS` class attribute using Pydantic's `Annotated` types with `Field` constraints:

	```python
	REQUIRED_INPUTS: dict[str, tuple[type, bool]] = {
	"demographics.age_years": (Annotated[int, Field(ge=18, le=100)], True),
	"demographics.sex": (Sex, True),
	"demographics.ethnicity": (Ethnicity \| None, False),
	"family_history": (list, False), # list[FamilyMemberCancer]
	"symptoms": (list, False), # list[SymptomEntry]
	}
	```

	#### Input Validation

	Every `compute_score` method must start with input validation:

	```python
	def compute_score(self, user: UserInput) -> str:
	"""Compute the risk score for a given user profile."""
	# Validate inputs first
	is_valid, errors = self.validate_inputs(user)
	if not is_valid:
	raise ValueError(f"Invalid inputs for {self.name}: {'; '.join(errors)}")

	# Model-specific validation
	if user.demographics.sex != Sex.FEMALE:
	return "N/A: Model is only applicable to female patients."

	# Continue with model-specific logic...
	```

	#### Data Access Patterns

	```python
	# Demographics
	age = user.demographics.age_years
	sex = user.demographics.sex
	ethnicity = user.demographics.ethnicity

	# Female-specific data
	if user.female_specific is not None:
	menarche_age = user.female_specific.menstrual.age_at_menarche
	num_births = user.female_specific.parity.num_live_births

	# Family history
	for member in user.family_history:
	if member.cancer_type == CancerType.BREAST:
	relation = member.relation
	age_at_diagnosis = member.age_at_diagnosis
	```

	#### Enum Usage

	Always use enums from `sentinel.user_input`, never string literals or custom enums:

	```python
	# ✅ Correct - using UserInput enums
	if user.demographics.sex == Sex.FEMALE:
	if member.cancer_type == CancerType.BREAST:
	if member.relation == FamilyRelation.MOTHER:

	# ❌ Incorrect - string literals
	if user.demographics.sex == "female":
	if member.cancer_type == "breast":

	# ❌ Incorrect - custom enums
	if user.demographics.sex == MyCustomSex.FEMALE:
	```

	Important: All risk models must use the same centralized enums from `UserInput`. If a required enum doesn't exist in `UserInput`, you must:
	1. Extend UserInput by adding the new enum to `src/sentinel/user_input.py`
	2. Never create model-specific enums - this prevents divergence between models
	3. Update all models to use the new centralized enum

	This ensures all risk models share the same data structure and prevents fragmentation.

	#### Extending UserInput

	When a risk model needs fields or enums that don't exist in `UserInput`:

	1. Add to UserInput: Extend `src/sentinel/user_input.py` with new fields/enums
	2. Update all models: Ensure all existing models can handle the new fields (use `\| None` for optional fields)
	3. Never create model-specific structures: This prevents divergence and fragmentation
	4. Test thoroughly: Add tests for new fields in `tests/test_user_input.py`

	Example of extending UserInput:
	```python
	# In src/sentinel/user_input.py
	class ChronicCondition(str, Enum):
	# ... existing values
	NEW_CONDITION = "new_condition" # Add new enum value

	class PersonalMedicalHistory(StrictBaseModel):
	# ... existing fields
	new_field: float \| None = Field(None, description="New field description")
	```

	#### Testing Requirements

	Create comprehensive test files with:
	- Ground Truth Validation: Test against known reference values
	- Input Validation: Test that invalid inputs raise `ValueError`
	- Edge Cases: Test boundary conditions and edge cases
	- Inapplicable Cases: Test cases where model should return "N/A"

	Example test structure:

	```python
	import pytest
	from sentinel.user_input import UserInput, Demographics, Sex
	from sentinel.risk_models import YourRiskModel

	GROUND_TRUTH_CASES = [
	{
	"name": "test_case_name",
	"input": UserInput(
	demographics=Demographics(
	age_years=40,
	sex=Sex.FEMALE,
	# ... other fields
	),
	# ... rest of input
	),
	"expected": 1.5, # Expected risk percentage
	},
	# ... more test cases
	]

	class TestYourRiskModel:
	@pytest.mark.parametrize("case", GROUND_TRUTH_CASES, ids=lambda x: x["name"])
	def test_ground_truth_validation(self, case):
	"""Test against ground truth results."""
	user_input = case["input"]
	expected_risk = case["expected"]

	actual_risk_str = self.model.compute_score(user_input)
	actual_risk = float(actual_risk_str)
	assert actual_risk == pytest.approx(expected_risk, abs=0.01)
	```

	#### Migration Checklist

	When adapting an existing risk model to the new structure:

	- [ ] Update imports to use new `user_input` module
	- [ ] Add `REQUIRED_INPUTS` with Pydantic validation
	- [ ] Refactor `compute_score` to use new `UserInput` structure
	- [ ] Replace string literals with enums
	- [ ] Update parameter extraction logic
	- [ ] Add input validation at start of `compute_score`
	- [ ] Update all test cases to use new `UserInput` structure
	- [ ] Run full test suite to ensure 100% pass rate
	- [ ] Run pre-commit hooks to ensure code quality

	## LLM and Code Assistant Guidelines

	When generating or modifying code, AI assistants MUST:

	### Mandatory Rules
	- Follow ALL guidelines in this document without exception
	- Never use forbidden constructs (`os.path`, `Optional[]`, `List[]`, `print()`, broad `except:`)
	- Never add decorative comment banners or unnecessary formatting
	- Always generate clean, modular, statically typed code

	### Code Generation Standards
	- Prefer clarity and simplicity over cleverness
	- Use modern Python type hints exclusively
	- Include comprehensive docstrings for non-trivial functions
	- Ensure all examples compile, type-check, and pass linting

	### Verification
	All generated code must:
	- Pass `ruff format` and `ruff check`
	- Include proper type hints
	- Use `pathlib.Path` for all file operations
	- Use `loguru` for logging
	- Follow the Variable Naming guidelines

	## Important Note for Developers

	When making changes to the project, ensure that the following files are updated to reflect the changes:

	- `README.md`
	- `AGENTS.md`
	- `GEMINI.md`

	For additional implementation details, refer to the existing risk model implementations in `src/sentinel/risk_models/`.