Skip to content

Files

Files are the raw data sources that can be uploaded to Catalyzed and processed into dataset tables.

Catalyzed supports common data file formats including:

  • CSV - Comma-separated values
  • JSON - JSON and JSON Lines (NDJSON)
  • Parquet - Apache Parquet columnar format
  • Excel - Microsoft Excel spreadsheets (.xlsx, .xls)

Files can be uploaded via the UI or API:

Terminal window
curl -X POST https://api.catalyzed.ai/files \
-H "Authorization: Bearer YOUR_API_TOKEN" \
-F "file=@data.csv" \
-F "teamId=YOUR_TEAM_ID"

After upload, files go through a processing pipeline that:

  1. Validates the file format and structure
  2. Infers or applies the schema
  3. Converts data to an optimized internal format
  4. Registers the data as a queryable table

Files transition through several states:

  • pending - Upload initiated
  • processing - Being processed and indexed
  • ready - Available for querying
  • failed - Processing failed (check error details)

See the Files API for complete endpoint documentation.