ArchiveBox/archivebox/cli at cf387ed59f46ff45157e8c0c96cff4fbd15f5ea7 - ArchiveBox - gitea

alex/ArchiveBox

mirror of https://github.com/ArchiveBox/ArchiveBox.git synced 2026-04-06 07:47:53 +10:00

Files

History

Claude cf387ed59f refactor: batch all URLs into single Crawl, update tests

- archivebox crawl now creates one Crawl with all URLs as newline-separated string
- Updated tests to reflect new pipeline: crawl -> snapshot -> extract
- Added tests for Crawl JSONL parsing and output
- Tests verify Crawl.from_jsonl() handles multiple URLs correctly

2025-12-30 20:06:56 +00:00

..

__init__.py

remove Seed model in favor of Crawl as template

2025-12-25 01:52:41 -08:00

archivebox_add.py

fix initial migrtaions

2025-12-29 21:27:31 -08:00

archivebox_config.py

wip

2025-12-28 17:51:54 -08:00

archivebox_crawl.py

refactor: batch all URLs into single Crawl, update tests

2025-12-30 20:06:56 +00:00

archivebox_extract.py

feat: add schema_version to JSONL outputs and remove dead code

2025-12-30 19:24:53 +00:00

archivebox_help.py

move main funcs into cli files and switch to using click for CLI

2024-11-19 00:18:51 -08:00

archivebox_init.py

use full dotted paths for all archivebox imports, add migrations and more fixes

2025-12-29 00:47:08 -08:00

archivebox_install.py

more migration fixes

2025-12-29 22:12:57 -08:00

archivebox_manage.py

fix archivebox shell and manage CLI commands

2024-11-19 00:48:39 -08:00

archivebox_mcp.py

add mcp server support

2025-12-25 01:51:42 -08:00

archivebox_orchestrator.py

use full dotted paths for all archivebox imports, add migrations and more fixes

2025-12-29 00:47:08 -08:00

archivebox_remove.py

wip

2025-12-28 17:51:54 -08:00

archivebox_schedule.py

wip major changes

2025-12-24 20:10:38 -08:00

archivebox_search.py

wip

2025-12-28 17:51:54 -08:00

archivebox_server.py

much better tests and add page ui

2025-12-29 04:02:11 -08:00

archivebox_shell.py

fix archivebox shell and manage CLI commands

2024-11-19 00:48:39 -08:00

archivebox_snapshot.py

fix: correct CLI pipeline data flow for crawl -> snapshot -> extract

2025-12-30 19:42:41 +00:00

archivebox_status.py

wip

2025-12-28 17:51:54 -08:00

archivebox_update.py

more migration fixes

2025-12-30 09:57:33 -08:00

archivebox_version.py

wip

2025-12-28 17:51:54 -08:00

archivebox_worker.py

much better tests and add page ui

2025-12-29 04:02:11 -08:00

tests_piping.py

refactor: batch all URLs into single Crawl, update tests

2025-12-30 20:06:56 +00:00

tests.py

use full dotted paths for all archivebox imports, add migrations and more fixes

2025-12-29 00:47:08 -08:00