aktualizacja dokumentacji w związku z TRASH i nowymi statusami plików.

This commit is contained in:
Grzegorz Michalski
2026-02-10 08:21:51 +01:00
parent 70909ba8c4
commit cdd9dff32d
6 changed files with 104 additions and 22 deletions

View File

@@ -371,11 +371,14 @@ INBOX Bucket - Pattern: 'INBOX/{SOURCE}/{SOURCE_FILE_ID}/{TABLE_NAME}/'
└── {pTableId}/ -- e.g., "A_UC_DISSEM_METADATA_LOADS", "STANDING_FACILITIES"
└── files matching {pSourceFileNamePattern}
ODS Bucket - Pattern: 'ODS/{SOURCE}/{TABLE_NAME}/'
── ODS/
DATA Bucket - Patterns: 'ODS/{SOURCE}/{TABLE_NAME}/' and 'TRASH/{SOURCE}/{TABLE_NAME}/'
── ODS/
│ └── {pSourceKey}/ -- e.g., "C2D", "LM"
│ └── {pTableId}/ -- e.g., "A_UC_DISSEM_METADATA_LOADS", "STANDING_FACILITIES"
│ └── processed files
└── TRASH/ -- File retention subfolder (not a separate bucket)
└── {pSourceKey}/ -- e.g., "C2D", "LM"
└── {pTableId}/ -- e.g., "A_UC_DISSEM_METADATA_LOADS", "STANDING_FACILITIES"
└── processed files
└── {pTableId}/ -- CSV files after archival (ARCHIVED_AND_TRASHED status)
ARCHIVE Bucket - Pattern: 'ARCHIVE/{SOURCE}/{TABLE_NAME}/'
└── ARCHIVE/
@@ -389,9 +392,11 @@ ARCHIVE Bucket - Pattern: 'ARCHIVE/{SOURCE}/{TABLE_NAME}/'
**Critical Path Pattern Requirements:**
- **INBOX** requires full 3-level path: `INBOX/{SOURCE}/{SOURCE_FILE_ID}/{TABLE_NAME}/`
- **ODS** uses simplified 2-level path: `ODS/{SOURCE}/{TABLE_NAME}/` (no SOURCE_FILE_ID)
- **TRASH** uses simplified 2-level path: `TRASH/{SOURCE}/{TABLE_NAME}/` (subfolder in DATA bucket)
- **ARCHIVE** uses simplified 2-level path: `ARCHIVE/{SOURCE}/{TABLE_NAME}/` (no SOURCE_FILE_ID)
- **All patterns are mandatory** - no simplified versions allowed
- File names must match `pSourceFileNamePattern` for automatic processing
- **Note**: TRASH is NOT a separate bucket - it's a subfolder within the DATA bucket
## Configuration Management Best Practices
@@ -693,7 +698,10 @@ SELECT FILE_MANAGER.PROCESS_SOURCE_FILE(
1. **File Arrival**: File is uploaded to Oracle Cloud Storage bucket
2. **Registration**: FILE_MANAGER.REGISTER_SOURCE_FILE_RECEIVED() creates record
3. **Status**: RECEIVED → VALIDATED → READY_FOR_INGESTION → INGESTED → ARCHIVED
3. **Status**: RECEIVED → VALIDATED → READY_FOR_INGESTION → INGESTED → ARCHIVED_AND_TRASHED → ARCHIVED_AND_PURGED (optional)
- Legacy ARCHIVED status maintained for backward compatibility
- ARCHIVED_AND_TRASHED: Files archived to Parquet and kept in TRASH folder (default)
- ARCHIVED_AND_PURGED: Files archived to Parquet and deleted from TRASH folder
4. **External Table**: Created automatically based on template table
5. **Data Loading**: Data is loaded into target ODS schema
6. **Archival**: File is moved to archive bucket after processing