Data Import and Export

Emma provides powerful tools for importing data into Infrahub from CSV files and exporting Infrahub data back to CSV format. This enables efficient migration from existing systems and data sharing between tools.

Data Exporter

Data import

The Data Importer allows you to upload CSV files and map their columns to Infrahub schema attributes, enabling you to bulk-load infrastructure data.

Supported file formats

CSV files with headers
UTF-8 encoding (recommended)
Common delimiters: comma, semicolon, tab
File size: Up to 100MB per upload

Import process

Step 1: Prepare your data

Ensure your CSV file:

Has column headers that roughly match your schema attributes
Contains clean data with consistent formatting
Includes required fields as defined by your schema
Uses consistent values for relationships and enums

Example CSV structure:

name,model,serial_number,location,ip_address,status
switch-01,Cisco 3850,FCW2140L0EF,datacenter-1,192.168.1.10,active
switch-02,Cisco 3850,FCW2140L0EG,datacenter-1,192.168.1.11,active

Step 2: Upload and map

Select your CSV file using the file upload interface
Choose the target schema from your Infrahub instance
Map CSV columns to schema attributes using the dropdown selectors
Configure relationship handling for foreign key references
Set data validation options like duplicate handling

Step 3: Preview and validate

Emma provides a preview showing:

Mapped data - How your CSV data will appear in Infrahub
Validation results - Any data quality issues or constraint violations
Relationship resolution - How foreign keys will be resolved
Import statistics - Number of records to be created/updated

Step 4: execute import

After reviewing the preview:

Confirm the import if everything looks correct
Monitor progress through the real-time status updates
Review results including success/failure counts and any error details

Advanced import features

Relationship Handling
Data Validation
Performance Options

Emma intelligently handles relationships between objects:

By Name/ID Lookup:

Reference existing objects by name or ID
Automatic resolution of relationship targets
Error reporting for unresolved references

Nested Object Creation:

Create related objects inline during import
Support for hierarchical data structures
Maintains referential integrity

Example:

device_name,interface_name,ip_address,vlan_name
switch-01,GigabitEthernet1/0/1,192.168.1.10,production
switch-01,GigabitEthernet1/0/2,192.168.1.11,management

Data export

The Data Exporter extracts data from Infrahub and formats it as CSV files for use in other tools or for backup purposes.

Export options

Schema-based export

Export all objects of a specific schema type:

Select schema from your Infrahub instance
Choose attributes to include in the export
Configure relationship handling (include related object details)
Set filtering criteria to limit exported records

Query-based export

Export data using custom filters:

Field filters - Filter by specific attribute values
Date ranges - Export data from specific time periods
Relationship filters - Filter based on related object properties
Complex queries - Combine multiple filter criteria

Bulk export

Export multiple schema types in a single operation:

Related schemas - Export parent and child objects together
Dependency order - Automatic ordering for reimport compatibility
Consistent snapshots - Ensure data consistency across exports

Export formats

Emma supports multiple CSV format options:

Standard CSV:

Comma-separated values
Header row with attribute names
UTF-8 encoding

Customizable Format:

Custom delimiters
Quote character options
Line ending preferences
Character encoding selection

Relationship Handling:

Flatten relationships into columns
Include relationship IDs
Export related object details

Advanced export features

Incremental export

Export only changes since last export:

Timestamp-based - Export records modified after a specific date
Change tracking - Export based on Infrahub's change tracking
Delta exports - Include only modified fields

Scheduled exports

Configure automated exports:

Recurring schedules - Daily, weekly, monthly exports
Event-triggered - Export on data changes
Multiple formats - Generate different exports for different consumers

Data quality and validation

Import validation

Emma validates imported data against:

Schema definitions - Ensure data matches expected structure
Data types - Validate numeric, date, and other typed fields
Constraints - Check uniqueness, required fields, and custom rules
Relationships - Verify referenced objects exist

Export verification

Exported data includes:

Metadata - Export timestamp, schema versions, record counts
Integrity checks - Checksums for data verification
Audit trail - Information about data source and transformation

Best practices

Import best practices

Validate data quality before import using external tools
Start with small batches to test mappings and validations
Use consistent naming for relationship references
Clean up data to remove duplicates and inconsistencies
Backup Infrahub before large imports

Export best practices

Document export purposes to choose appropriate options
Test import compatibility if exporting for reimport
Use incremental exports for regular synchronization
Include metadata for audit and tracking purposes
Validate exported data before using in downstream systems

Performance optimization

Use appropriate batch sizes (typically 100-1000 records)
Limit concurrent operations to avoid overwhelming Infrahub
Monitor system resources during large operations
Use filtering to export only necessary data
Schedule large operations during low-usage periods

Integration scenarios

Common use cases

Migration from Legacy Systems:

Export data from existing tools to CSV
Transform and clean data as needed
Import into Infrahub schemas

Data Synchronization:

Regular exports to external systems
Incremental updates based on changes
Bidirectional synchronization workflows

Backup and Recovery:

Full data exports for backup purposes
Schema-consistent exports for disaster recovery
Point-in-time data snapshots

Reporting and Analytics:

Export data for business intelligence tools
Create regular data extracts for reporting
Integration with data lakes and warehouses

Integration with other Emma features

Schema Library - Use library schemas as import targets
Schema Builder - Create schemas for imported data
Schema Visualizer - Understand data relationships before import/export

Troubleshooting

Common import issues

File Upload Problems:

Check file size limits (100MB maximum)
Verify file format and encoding
Ensure proper CSV structure with headers

Mapping Errors:

Verify schema attribute names
Check data type compatibility
Review required field mappings

Validation Failures:

Check data quality and formatting
Verify relationship references exist
Review constraint violations

Performance Issues:

Reduce batch sizes
Check network connectivity
Monitor Infrahub resource usage

Common export issues

Missing Data:

Verify user permissions for accessed schemas
Check filtering criteria
Review relationship configurations

Format Problems:

Verify encoding settings
Check delimiter configuration
Review quote character handling

Performance Issues:

Use filtering to reduce data volume
Increase timeout settings
Monitor export progress

For detailed troubleshooting, see the Troubleshooting Guide.

Data import​

Supported file formats​

Import process​

Step 1: Prepare your data​

Step 2: Upload and map​

Step 3: Preview and validate​

Step 4: execute import​

Advanced import features​

Data export​

Export options​

Schema-based export​

Query-based export​

Bulk export​

Export formats​

Advanced export features​

Incremental export​

Scheduled exports​

Data quality and validation​

Import validation​

Export verification​

Best practices​

Import best practices​

Export best practices​

Performance optimization​

Integration scenarios​

Common use cases​

Integration with other Emma features​

Troubleshooting​

Common import issues​

Common export issues​