Files
ArchiveBox/archivebox/core
Claude d36079829b feat: replace index.json with index.jsonl flat JSONL format
Switch from hierarchical index.json to flat index.jsonl format for
snapshot metadata storage. Each line is a self-contained JSON record
with a 'type' field (Snapshot, ArchiveResult, Binary, Process).

Changes:
- Add JSONL_INDEX_FILENAME constant to constants.py
- Add TYPE_PROCESS and TYPE_MACHINE to jsonl.py type constants
- Add binary_to_jsonl(), process_to_jsonl(), machine_to_jsonl() converters
- Add Snapshot.write_index_jsonl() to write new format
- Add Snapshot.read_index_jsonl() to read new format
- Add Snapshot.convert_index_json_to_jsonl() for migration
- Update Snapshot.reconcile_with_index() to handle both formats
- Update fs_migrate to convert during filesystem migration
- Update load_from_directory/create_from_directory for both formats
- Update legacy.py parse_json_links_details for JSONL support

The new format is easier to parse, extend, and mix record types.
2025-12-30 18:21:06 +00:00
..
2025-12-30 09:57:33 -08:00
wip
2025-12-28 17:51:54 -08:00
2024-11-18 04:27:37 -08:00
wip
2025-12-28 17:51:54 -08:00
wip
2025-12-28 17:51:54 -08:00
wip
2025-12-28 17:51:54 -08:00
2025-12-29 04:02:11 -08:00
wip
2025-12-28 17:51:54 -08:00
wip
2025-12-28 17:51:54 -08:00
wip
2025-12-28 17:51:54 -08:00
2025-12-29 04:02:11 -08:00
2025-12-29 21:27:31 -08:00