document_redaction / tools /find_duplicate_pages.py

Commit History

Major update. General code revision. Improved config variables. Dataframe based review frame now includes text, items can be searched and excluded. Costs now estimated. Option for adding cost codes added. Option to extract text only.
0ea8b9e

seanpedrickcase commited on

More config options. Fixed some bugs with removing elements from review page and Adobe export. Some UI rearrangements
6319afc

seanpedrickcase commited on

Laid groundwork for passing in AWS API keys. Duplicate pages option should now work for pages with no text.
7907ad4

seanpedrickcase commited on

Added tab to be able to compare pages across multiple documents and redact duplicates
a265560

seanpedrickcase commited on