Commit History

fix NotValidLengthException
497072d

Muhammad Abdur Rahman Saad commited on

refactor logging and streamline content update process
93058c6

gavinzli commited on

creation of flask api
ab755b3

Muhammad Abdur Rahman Saad commited on

fix(eastmoney)
32cebdb

Muhammad Abdur Rahman Saad commited on

Return empty string on translation failure in translate function
b6a6edc

gavinzli commited on

Update utils.py
3c227dd
unverified

OxbridgeEconomics commited on

Increase timeout for URL requests in crawl functions to enhance reliability
cc76656

gavinzli commited on

Add timeout parameter to URL requests in crawl functions for improved reliability
fed78ac

gavinzli commited on

Handle LangDetectException in crawl_by_url function to improve error handling
48adbee

gavinzli commited on

Refactor vectorization process by removing openai_vectorize calls and updating vectorizer initialization
5fea365

gavinzli commited on

Add validation for content length and enhance error handling in crawl_by_url function
29d3eca

gavinzli commited on

Merge branch 'main' of https://github.com/oxbridge-econ/data-collection-china
b4bd94d

gavinzli commited on

Add handling for DependencyError in PDF extraction and update requirements to include pycryptodome
beed350

gavinzli commited on

Update utils.py
c9d52fa
unverified

OxbridgeEconomics commited on

Refactor content update process to ensure reference ID is set to None and re-enable vectorization functions in article processing
b68d569

gavinzli commited on

Fix table name casing in update_content function for DynamoDB
2512706

gavinzli commited on

Add reference ID extraction and implement retry logic for document addition
693e166

gavinzli commited on

Increase retry attempts and adjust sleep duration for translation requests
1269de7

gavinzli commited on

Refactor translation error handling and remove debug print statements in vectorization
0750507

gavinzli commited on

Implement retry logic for translation requests to handle RequestError exceptions
c664824

gavinzli commited on

Replace logging with print statements for content update and reference extraction functions
f237a77

gavinzli commited on

Add logging configuration and info statements for content updates and reference extraction
fbf8f15

gavinzli commited on

Limit content length to 500 characters in sentiment computation for improved analysis accuracy
dcdb6e8

gavinzli commited on

Refactor exception handling in multiple files to specify exception types and improve logging
d705151

gavinzli commited on

Refactor error handling and improve logging in utils.py; update vectorization process in vectorizer.py; adjust variable naming in eastmoney.py
c39d841

gavinzli commited on

commit
5737030

OxbridgeEconomics commited on

commit
4e18ce3

OxbridgeEconomics commited on

commit
0e15728

OxbridgeEconomics commited on

commit
01677a0

OxbridgeEconomics commited on

commit
270ad28

OxbridgeEconomics commited on

commit
964df08

OxbridgeEconomics commited on

commit
8467896

OxbridgeEconomics commited on

commit
74475ac

OxbridgeEconomics commited on