Commit Graph

4582 Commits

Author SHA1 Message Date
Nick Sweeting
f6d22a3cc4 tweak worker updated logic and add output_dir_template and symlinks logic 2024-12-13 06:03:52 -08:00
Nick Sweeting
34e4b48557 add example js extractor 2024-12-12 22:15:17 -08:00
Nick Sweeting
74e08a18aa add filestore migrations 2024-12-12 22:15:17 -08:00
Nick Sweeting
c11a1b54f1 add new worker test 2024-12-12 22:08:18 -08:00
Nick Sweeting
5c06b8ff00 add new Event model to workers/models 2024-12-12 22:08:17 -08:00
Nick Sweeting
651ba0b11c add new Process model to Machine models 2024-12-12 21:45:55 -08:00
Nick Sweeting
2a1afcf6c2 move crawl models back into dedicated app 2024-12-12 21:45:55 -08:00
Nick Sweeting
bd5dd2f949 clearer core models separation of concerns using new basemodels 2024-12-12 21:45:53 -08:00
Nick Sweeting
930b9bf386 add archivebox worker cli cmd to list of all cmds 2024-12-12 21:44:44 -08:00
Nick Sweeting
bab26d6a9b better base_models separation of concerns 2024-12-12 21:44:43 -08:00
Nick Sweeting
51447b9d0a bump django version to 5.1.4 2024-12-12 21:42:15 -08:00
Nick Sweeting
6b3e297db8 fix lock_pkgs.sh version parsing and python version 2024-12-12 21:41:45 -08:00
Nick Sweeting
5cf7725f0e add new archivebox worker implementation based on better distributed systems principles 2024-12-12 21:41:45 -08:00
Nick Sweeting
a859278a63 tags apps.py 2024-12-12 21:41:45 -08:00
Nick Sweeting
1444cf7fda add new KVTags system 2024-12-12 21:41:44 -08:00
Nick Sweeting
81bf81ab10 add extract.js prototype extractor 2024-12-06 02:06:40 -08:00
Nick Sweeting
ac53fdf677 make chrome binary and configs directly runnable and make extractor use external bin 2024-12-06 02:06:39 -08:00
Nick Sweeting
a572db307b fix syntax errors (#1609) 2024-12-05 19:36:37 -05:00
dish
f1b9aec873 fix syntax errors 2024-12-05 13:52:33 -05:00
Nick Sweeting
d192eb5c48 add filestore content addressible store draft 2024-12-04 02:15:04 -08:00
Nick Sweeting
dc0f1b0efc add new File model in filestore 2024-12-04 02:15:04 -08:00
Nick Sweeting
a3fe78afaa add basename to hashing get_dir_info 2024-12-04 02:15:04 -08:00
Nick Sweeting
73a75bb4c9 Update FUNDING.yml 2024-12-04 01:38:07 -08:00
Nick Sweeting
8c8ec6aff0 add extractors README 2024-12-03 02:15:17 -08:00
Nick Sweeting
dcd7e2555e add new archivebox_extract cli command 2024-12-03 02:14:56 -08:00
Nick Sweeting
337acdac9c add base extractor class 2024-12-03 02:14:42 -08:00
Nick Sweeting
1ceaa1ac7a add ABID model check and fix model inheritance 2024-12-03 02:14:21 -08:00
Nick Sweeting
c374d7695e allow getting crawl from API as rss feed 2024-12-03 02:13:45 -08:00
Nick Sweeting
eae7ed8447 add hashing misc library for merkle tree generation 2024-12-03 02:12:20 -08:00
Nick Sweeting
22901406aa Update 2-feature_request.yml 2024-11-22 18:29:58 -05:00
Nick Sweeting
44d337a167 convert index.schema.ArchiveResult and Link to pydantic 2024-11-19 06:32:48 -08:00
Nick Sweeting
b948e49013 add urls log to Crawl model 2024-11-19 06:32:33 -08:00
Nick Sweeting
28386ff172 add jobs_dashboard.html back 2024-11-19 05:35:52 -08:00
Nick Sweeting
4dd53dc12a Merge branch 'newchanges' into dev 2024-11-19 05:28:20 -08:00
Nick Sweeting
b852951c58 fix cli loading edge case where setup_django wasnt running when it should 2024-11-19 05:27:35 -08:00
Nick Sweeting
6b47510f70 always pre-setup binproviders 2024-11-19 05:24:12 -08:00
Nick Sweeting
f8e2f7c753 restore missing archivebox_update work 2024-11-19 05:09:19 -08:00
Nick Sweeting
52446b86ba restore missing archivebox_status work 2024-11-19 05:08:41 -08:00
Nick Sweeting
0f536ff18b restore missing archivebox_schedule work 2024-11-19 05:07:55 -08:00
Nick Sweeting
fe3320eff0 restore missing archivebox_remove work 2024-11-19 05:07:12 -08:00
Nick Sweeting
230bf34e14 restore missing archivebox_config work 2024-11-19 05:05:06 -08:00
Nick Sweeting
ee548eb16e fix archivebox install not using LIB_DIR 2024-11-19 04:44:43 -08:00
Nick Sweeting
6740202d78 fix cli loading edge case where setup_django wasnt running when it should 2024-11-19 04:20:00 -08:00
Nick Sweeting
f21b86aba8 better cli colors 2024-11-19 04:10:07 -08:00
Nick Sweeting
0f860d40f1 working archivebox_status CLI cmd 2024-11-19 04:05:05 -08:00
Nick Sweeting
292730ebad working archivebox_schedule cmd 2024-11-19 03:54:47 -08:00
Nick Sweeting
3a64ced697 fix archivebox delete errors 2024-11-19 03:45:44 -08:00
Nick Sweeting
0347b911aa archivebox add and remove CLI cmds 2024-11-19 03:40:01 -08:00
Nick Sweeting
2595139180 improve statemachine logging and archivebox update CLI cmd 2024-11-19 03:31:05 -08:00
Nick Sweeting
c9a05c9d94 working archivebox update CLI cmd 2024-11-19 02:32:05 -08:00