Document Versioning & RAG Processing
Pidima's advanced document management system provides version control, intelligent processing, and AI-powered document understanding through RAG (Retrieval-Augmented Generation) technology. This ensures your AI features always work with the most relevant and up-to-date documentation.
Overview
When you attach documents to your project, Pidima:
- Stores Multiple Versions: Maintains complete version history
- Processes for AI: Extracts and indexes content using RAG
- Monitors Status: Tracks document processing and availability
- Enables Context-Aware AI: Powers chatbot and Requirements Assist with document knowledge
Document Versions
Version Management
Each document attachment supports multiple versions:
- Version Numbers: Automatic versioning (v1.0, v1.1, v2.0, etc.)
- Version Status: Active, Obsolete, or Processing
- Upload Tracking: Timestamp and uploader for each version
- Size Monitoring: File size for each version

Version Indicators
The document list displays key version information:
- Badge Count: Shows "2 versions" or number of versions available
- Active Version: Highlighted version currently in use (green "Active" badge)
- Obsolete Versions: Previous versions marked as "Obsolete" (gray badge)
- Latest Version: The most recently uploaded version
Uploading New Versions
To add a new version of an existing document:
- Click "Attach New Document"
- Upload a file with the same name as an existing document
- System automatically creates a new version
- Previous version is marked as obsolete by default
- New version enters processing queue
Alternatively, use the Edit action next to a document to:
- Replace with a new version
- Update description
- Modify metadata
RAG Processing
What is RAG?
RAG (Retrieval-Augmented Generation) is an advanced AI technique that powers Pidima's intelligent document understanding:
- Extracts Content: Parses text from PDFs, Word docs, and other formats
- Chunks Information: Breaks documents into semantic chunks for efficient processing
- Creates Embeddings: Generates AI-understandable vector representations
- Enables Retrieval: Allows AI to find relevant sections quickly and accurately
- Enhances Responses: Grounds AI answers in your actual documents, not generic knowledge
- Version-Aware: Tracks which document version was used for each AI interaction
How RAG Powers Pidima's AI Features
RAG processing enables several intelligent features:
1. AI Chatbot Enhancement
- Answers questions by referencing your specific documents
- Cites exact page numbers and sections
- Compares multiple documents simultaneously
- Provides document-grounded responses, not generic AI knowledge
2. Requirements Assist
- Uses document content for intelligent autofill suggestions
- References specifications when rewriting requirements
- Derives child requirements from parent specifications
- Validates requirements against documented standards
3. Test Generation
- Creates test cases based on specification details
- Includes acceptance criteria from uploaded documents
- References test procedures in your documentation
- Ensures coverage of all documented requirements
4. Compliance Checking
- Compares requirements against regulatory standards
- Identifies gaps in compliance documentation
- Suggests additions based on standard requirements
- Provides traceability to specific regulation clauses
Processing Status
Each document version shows processing status:
Processed (Green)
- ✓ Document fully processed and indexed
- ✓ Available for AI features
- ✓ Content searchable by chatbot
- ✓ Ready for Requirements Assist operations
Processing (Yellow/Orange)
- ⏳ Document being processed
- ⏳ AI features may have limited access
- ⏳ Typically completes within 1-5 minutes
- ⏳ Depends on document size and complexity
Failed (Red)
- ✗ Processing encountered an error
- ✗ Document may be corrupted or unsupported
- ✗ Not available for AI features
- ✗ Requires re-upload or format conversion
Processing Time
Processing duration varies by:
-
Document Size: Larger files take longer
- Small (< 1 MB): 1-2 minutes
- Medium (1-10 MB): 2-5 minutes
- Large (> 10 MB): 5-15 minutes
-
Document Type: Some formats require more processing
- Text/PDF: Fastest
- Scanned PDFs: Requires OCR, slower
- Complex layouts: Additional processing time
-
System Load: Peak usage may increase processing time
Applied Levels and Modules
Configuring Document Scope
Control which parts of your project use each document:
Applied Levels Specify which requirement levels should reference this document:
- Customer Requirements
- System Requirements
- Subsystem Requirements
- Component Requirements
Applied Modules Select functional modules that use this document:
- Software
- Hardware
- Mechanical
- Electrical
- Safety
- Security
Benefits of Scoping
Properly configured scope ensures:
- Relevant AI Responses: Chatbot references appropriate documents
- Faster Processing: AI searches smaller, relevant document sets
- Better Context: Requirements Assist uses domain-specific documents
- Cleaner Results: Reduces irrelevant information in AI outputs
Configuration Example
For a document "EV_Battery_Requirements.pdf":
Applied Levels:
✓ System Requirements
✓ Subsystem Requirements
Applied Modules:
✓ Electrical
✓ Safety
⚬ Software (unchecked)
⚬ Mechanical (unchecked)
Result: This document powers AI features only for electrical and safety requirements at system and subsystem levels.
Monitoring RAG Status
Real-Time Monitoring
The document table includes a Monitoring column showing comprehensive processing status:
- Enabled: RAG processing active and updated regularly
- Status: Current processing state (Processed, Processing, Failed)
- Last Update: When the document was last processed
- Refresh Rate: How often the system checks for updates
- Processing Time: How long the last processing took
- Chunk Count: Number of document chunks created
Example:
Monitoring: ✓ Enabled
Every 1m
Last: 568h ago
Status: ✓ Processed
Chunks: 247
This indicates:
- Monitoring is active and working properly
- System checks every 1 minute for changes
- Document was processed 568 hours ago (about 24 days)
- Successfully processed into 247 semantic chunks
- Ready for AI feature use
Understanding Processing Status Colors
Green Badge: "Processed" ✓
- Document fully indexed and available
- All 247 chunks created successfully
- AI features can access this document
- Chatbot will cite this document
- Requirements Assist can use content
- Ready for production use
Yellow/Orange Badge: "Processing" ⏳
- Document currently being analyzed
- Chunks being created in background
- Partial availability for AI features
- Usually completes within 1-5 minutes
- No action required - wait for completion
Red Badge: "Failed" ✗
- Processing encountered an error
- Document not available for AI features
- May be corrupted or unsupported format
- Action required: Check format, re-upload, or convert
- Contact support if issue persists
Processing Performance
Small Documents (< 1 MB):
- Processing time: 1-2 minutes
- Chunk count: 50-100 chunks
- Memory usage: Low
- Ideal for: Quick reference docs, standards excerpts
Medium Documents (1-10 MB):
- Processing time: 2-5 minutes
- Chunk count: 100-500 chunks
- Memory usage: Moderate
- Ideal for: Full specifications, detailed requirements
Large Documents (> 10 MB):
- Processing time: 5-15 minutes
- Chunk count: 500-2000+ chunks
- Memory usage: High
- Consider: Breaking into smaller logical sections for better performance
Edit Monitoring Settings
Click the Edit button next to a document version to:
- Enable/Disable Monitoring: Turn RAG processing on or off
- Adjust Frequency: Change how often content is re-indexed
- View Processing Logs: See detailed processing history
- Trigger Re-processing: Force document re-processing if needed
When to Use Monitoring
Enable Monitoring For:
- Documents that change frequently
- Living specifications updated regularly
- Documents critical to AI accuracy
- Recently uploaded documents in processing
Disable Monitoring For:
- Finalized, unchanging documents
- Obsolete versions no longer in use
- Documents that successfully processed once
- Large files to reduce processing load
Version Status Management
Active vs. Obsolete
Active Status (Green Badge)
- Current version in use
- Referenced by AI features
- Included in searches
- Used for new requirements
Obsolete Status (Gray Badge)
- Previous version, archived
- Not used by AI features
- Maintained for audit trail
- Can be restored if needed
Managing Status
To change version status:
- Locate the document version in the table
- Click the status badge or Edit button
- Select new status:
- Set as Active: Make this version current
- Mark as Obsolete: Archive this version
- Confirm the change
Note: Only one version of a document can be "Active" at a time. Setting a new version as Active automatically marks the previous active version as Obsolete.
AI Integration
How RAG Enhances AI Features
AI Chatbot
- Answers questions using document content
- Cites specific sections and page numbers
- Provides context from specifications
- Compares requirements to standards
Requirements Assist
- Autofill: Uses documents to determine appropriate values
- Rewrite: References specifications for enhanced clarity
- Level Generation: Derives requirements from source documents
- Validation: Checks requirements against documented standards
Test Generation
- Creates test cases based on specification details
- Includes acceptance criteria from documents
- References test procedures in uploaded docs
- Ensures coverage of documented requirements
Viewing Document Context
See which documents inform AI responses:
- Use the AI Chatbot to ask about a requirement
- The AI may cite: "According to EV_Battery_Requirements.pdf (v1.1)..."
- Click cited documents to view in context
- See exactly which sections influenced the AI response
Supported Document Formats
Fully Supported
- PDF (.pdf): Including scanned documents with OCR
- Microsoft Word (.doc, .docx)
- Plain Text (.txt)
- Markdown (.md)
- Rich Text (.rtf)
Partially Supported
- Microsoft Excel (.xls, .xlsx): Extracts text from cells
- PowerPoint (.ppt, .pptx): Extracts slide text
- HTML (.html, .htm): Parses readable content
Not Supported
- Image files (.jpg, .png) without OCR
- CAD files (.dwg, .dxf)
- Proprietary formats without text extraction
- Encrypted or password-protected documents
Real-World RAG Use Cases
Use Case 1: Multi-Version Specification Management
Scenario: Automotive manufacturer with evolving battery specifications
Setup:
EV_Battery_Car_Requirements.pdf
├── Version 1.0 (Obsolete) - 12V specifications
│ Status: Processed ✓
│ Chunks: 156
│ Applied Levels: System, Subsystem
│
└── Version 1.1 (Active) - 24V specifications
Status: Processed ✓
Chunks: 189
Applied Levels: System, Subsystem, Component
Last processed: 2 hours ago
Workflow:
- Version 1.0 uploaded → RAG processes → 156 chunks created
- AI Chatbot uses v1.0 for all queries about battery requirements
- Specifications updated → Version 1.1 uploaded
- RAG processes v1.1 → 189 chunks (more detailed)
- v1.1 set as "Active" → AI now references new version
- v1.0 marked "Obsolete" → Kept for historical reference
Result:
- AI always uses latest approved specifications
- Historical versions maintained for audit trail
- Requirement changes tracked with document version references
- Seamless transition with zero AI confusion
Use Case 2: Regulatory Compliance with ISO Standards
Scenario: Medical device company ensuring ISO 13485 compliance
Setup:
ISO_13485_Quality_Management.pdf
├── Status: Processed ✓
├── Chunks: 847 (large standard document)
├── Processing time: 12 minutes
├── Applied Levels: All levels
├── Applied Modules: Quality, Safety, Documentation
└── Monitoring: Enabled (checks annually for standard updates)
AI Integration:
User in Chatbot: "Does requirement MED-REQ-00015 meet ISO 13485 section 7.3?"
AI Response:
"Analyzing requirement MED-REQ-00015 against ISO 13485:2016 section 7.3
(Design and Development):
✓ Section 7.3.2 (Design Inputs): COMPLIANT
Your requirement specifies user needs and regulatory requirements
⚠ Section 7.3.3 (Design Outputs): PARTIAL
Missing: Design output documentation format requirements
Recommendation: Add clause about design output documentation per 7.3.3.b
✗ Section 7.3.4 (Design Review): NON-COMPLIANT
Not found: Design review checkpoints and criteria
Required: Add design review stages per 7.3.4.a-e
Source: ISO_13485_Quality_Management.pdf, pages 23-26"
Benefits:
- Real-time compliance checking during requirement authoring
- Specific citations to standard clauses
- Gap identification before audits
- Reduced compliance risk
Use Case 3: Cross-Document Analysis
Scenario: Aerospace project with multiple related specifications
Setup:
Project Documents (All Processed):
1. DO-178C_Software_Standard.pdf (Active)
Chunks: 1,247 | Applied: Software modules
2. ARP4754A_System_Development.pdf (Active)
Chunks: 892 | Applied: System level
3. Internal_Coding_Standards_v3.pdf (Active)
Chunks: 234 | Applied: Software component level
4. Customer_Requirements_Spec.pdf (Active)
Chunks: 456 | Applied: Customer requirements level
AI Chatbot Query:
User: "Does our software requirement SW-REQ-0042 satisfy both DO-178C
and our customer requirements?"
AI Response:
"Cross-referencing SW-REQ-0042 across multiple documents:
DO-178C Compliance (DO-178C_Software_Standard.pdf):
✓ Section 5.1.1: High-level requirements - COMPLIANT
✓ Section 5.1.3: Derived requirements - COMPLIANT
⚠ Section 6.3.1: Missing test coverage criteria
Customer Requirements (Customer_Requirements_Spec.pdf):
✓ Requirement CUS-001 (page 15): Fully addresses
✓ Requirement CUS-003 (page 18): Partially addresses
✗ Requirement CUS-007 (page 22): Not addressed
Internal Standards (Internal_Coding_Standards_v3.pdf):
✓ Naming conventions: Compliant
✓ Documentation format: Compliant
Recommendation: Add test coverage criteria per DO-178C 6.3.1 and
link to CUS-007 for complete coverage."
Benefits:
- Single query analyzes multiple standards simultaneously
- Identifies gaps across all relevant documents
- Provides specific page references
- Saves hours of manual cross-referencing
Use Case 4: Document Version Comparison
Scenario: Tracking changes between specification versions
Setup:
Battery_Specification.pdf
├── Version 2.0 (Obsolete)
│ Chunks: 178
│ Key content: 12V battery requirements
│
└── Version 3.0 (Active)
Chunks: 203
Key content: 24V battery requirements (updated)
AI Analysis:
User: "What changed between version 2.0 and 3.0 of the battery spec?"
AI Response:
"Comparing Battery_Specification.pdf versions:
Version 2.0 (Obsolete):
- Nominal voltage: 12V (section 3.1, page 8)
- Operating range: 11.5V - 12.5V
- Safety standard: Basic (< 20V category)
Version 3.0 (Active):
- Nominal voltage: 24V (section 3.1, page 8)
- Operating range: 23V - 25V
- Safety standard: Enhanced (> 20V category, IEC 60950-1)
- NEW: Arc flash protection requirements (section 5.3, page 15)
- NEW: Insulation rating requirements (section 5.4, page 16)
Impact on Requirements:
- 15 existing requirements reference 12V → Need updates
- 3 new safety requirements needed for 24V compliance
- Test coverage gaps: 8 additional tests recommended"
Benefits:
- Automated change detection between versions
- Impact analysis on existing requirements
- Identifies new requirements needed
- Streamlines version transition process
Use Case 5: Intelligent Test Generation from Specs
Scenario: Generating test cases directly from specification documents
Setup:
EV_Battery_Test_Procedures.pdf (Active)
├── Status: Processed ✓
├── Chunks: 312
├── Applied Levels: System, Subsystem
└── Content: Detailed test procedures for battery validation
Workflow:
- User selects requirement: "Battery shall meet voltage specifications"
- Clicks: "Generate Tests with AI"
- AI analyzes:
- Requirement text
- Linked specification: EV_Battery_Test_Procedures.pdf
- Industry standards (if uploaded)
AI-Generated Tests:
Test Case TC-BAT-001: Nominal Voltage Measurement
Source: EV_Battery_Test_Procedures.pdf, Section 4.2.1, page 18
Description: "Measure battery voltage at rest (no load) after
2-hour stabilization period per section 4.2.1"
Steps (from specification):
1. Disconnect battery from all loads
2. Allow 2-hour stabilization at 20°C ± 2°C
3. Connect calibrated multimeter (±0.1V accuracy)
4. Record voltage reading
5. Verify: 24.0V ± 0.6V (23.4V - 24.6V)
Expected Result: Voltage within specified range
Link: SYSTEM-REQ-00003 (Battery Verification)
Benefits:
- Tests generated from authoritative source documents
- Includes specific procedures from specifications
- Automatically linked to requirements
- Maintains compliance with documented test methods
Best Practices
Document Naming
Use clear, consistent naming:
- Good:
EV_Battery_Requirements_v2.1.pdf - Good:
ISO_26262_Compliance_Standards.pdf - Avoid:
doc1.pdf,final_FINAL_v2_FINAL.pdf
Version Control
Maintain clean version history:
- Upload new versions only when content changes significantly
- Mark obsolete versions promptly
- Keep only necessary versions (archive or delete old versions)
- Document changes in the description field
Scope Configuration
Set accurate Applied Levels and Modules:
- Be specific about where documents apply
- Don't over-apply documents to all levels/modules
- Update scopes when project structure changes
- Review scopes quarterly
Monitoring Strategy
Balance accuracy with performance:
- Enable monitoring for active documents
- Disable monitoring for stable, finalized documents
- Use longer intervals (5m+) for large documents
- Re-process after document updates
Document Quality
Ensure AI can effectively process documents:
- Use text-based PDFs, not just scanned images
- Ensure good scan quality if using OCR
- Structure documents with clear headings
- Break very large documents into logical sections
- Avoid excessive formatting that obscures text
RAG Performance Optimization
Maximizing AI Accuracy
Tip 1: Document Quality Matters
- Use text-based PDFs (not scanned images when possible)
- Ensure good scan quality if using OCR (300+ DPI)
- Avoid heavily formatted documents with complex layouts
- Use clear section headings and structure
Tip 2: Optimal Document Size
- Split very large documents (>50 MB) into logical sections
- Example: Instead of "Complete_ISO_Standards.pdf" (100 MB)
- Use: "ISO_13485_Section_7_Design.pdf" (15 MB)
- Use: "ISO_13485_Section_8_Production.pdf" (12 MB)
- Benefits: Faster processing, more accurate retrieval, easier maintenance
Tip 3: Applied Levels Configuration
- Be specific about where documents apply
- Don't apply all documents to all levels
- Example for battery specification:
EV_Battery_Requirements.pdf✓ Applied to: System Requirements, Subsystem Requirements✗ Not applied to: Software Component RequirementsResult: AI won't confuse software developers with battery specs
Tip 4: Active vs Obsolete Management
- Keep only ONE version as "Active" per document name
- Mark superseded versions as "Obsolete" immediately
- Prevents AI from mixing information from different versions
- Example:
Battery_Spec_v1.pdf → Obsolete (12V era)Battery_Spec_v2.pdf → Active (24V current)
Tip 5: Monitoring Strategy
- Enable monitoring for living documents that change frequently
- Disable monitoring for finalized, stable specifications
- Adjust frequency based on update cadence:
- Active development: Every 1 minute
- Weekly updates: Every 1 hour
- Stable docs: Every 24 hours or disable
Troubleshooting RAG Issues
Issue: AI Not Citing My Document
Checklist:
- ✓ Document status is "Processed" (green badge)
- ✓ Document version is "Active" (not Obsolete)
- ✓ Applied Levels include the requirement level you're working on
- ✓ Applied Modules include the functional area
- ✓ Monitoring is "Enabled"
- ✓ Ask chatbot specifically: "What does [DocumentName] say about X?"
Issue: AI Gives Inconsistent Answers
Possible causes:
- Multiple document versions marked as "Active"
- Conflicting information in different documents
- Document not fully processed (yellow status)
Solution:
- Check document versions - mark only one as Active
- Wait for processing to complete (green status)
- Review Applied Levels to ensure appropriate scope
- Regenerate RAG index if needed (Edit → Re-process)
Issue: Processing Stuck on "Processing" Status
If document shows processing for > 30 minutes:
- Check document size (may timeout if > 100 MB)
- Verify file format is supported
- Try re-uploading the document
- Split large documents into smaller files
- Contact support with document details
Troubleshooting
Document Not Processing
If a document shows "Processing" for extended periods:
- Check document size (> 50 MB may time out)
- Verify file format is supported
- Try re-uploading the document
- Break large documents into smaller parts
- Contact support if issue persists
AI Not Referencing Document
If AI features don't use your document:
- Verify status is "Active" and "Processed"
- Check Applied Levels and Modules match your needs
- Ensure monitoring is enabled
- Wait for processing to complete (check status)
- Try asking the chatbot specifically about the document
Slow Processing
To improve processing speed:
- Reduce monitoring frequency (e.g., every 5m instead of 1m)
- Disable monitoring for documents that don't change
- Upload documents during off-peak hours
- Optimize document size before uploading
- Use text-based PDFs instead of scanned images
Version Confusion
If unsure which version is current:
- Look for the green "Active" badge
- Check the "Uploaded" date for latest version
- Review the version badge (e.g., "v1.1")
- Read document descriptions for clarification
Advanced Features
Bulk Version Management
Manage multiple document versions efficiently:
- Select multiple documents using checkboxes
- Use bulk actions to:
- Mark as Active/Obsolete
- Enable/Disable monitoring
- Delete unused versions
- Update Applied Levels/Modules
Document Analytics
Track document usage and effectiveness:
- View Processing History: See all processing events
- AI Reference Count: How often AI cites this document
- Coverage Metrics: Which requirements link to this document
- Version Comparison: Compare content between versions
API Integration
Automate document management:
- Upload documents via API
- Trigger RAG processing programmatically
- Query document status
- Retrieve version history
- Update metadata and settings
See Integration & API for details.