You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Have you considered a possibility to re-duplicate/un-deduplicate a warc file (or a set of warc files) in some way? Or is that already possible?
For example if you want to export all data in a set of warc files from a larger set, plus all data they reference through revisit records. Either through replacing revisit records with actual records or through also exporting the revisit-referenced files, but let them only contain the records that were referenced.
The text was updated successfully, but these errors were encountered:
Have you considered a possibility to re-duplicate/un-deduplicate a warc file (or a set of warc files) in some way? Or is that already possible?
For example if you want to export all data in a set of warc files from a larger set, plus all data they reference through revisit records. Either through replacing revisit records with actual records or through also exporting the revisit-referenced files, but let them only contain the records that were referenced.
The text was updated successfully, but these errors were encountered: