Commit History

Refactor logging in utils and sohu_ccef modules; remove unused test.py file
b38c8d3

gavinzli commited on

Update sohu_ccef.py
003e00d
unverified

OxbridgeEconomics commited on

add exponential backoff logic for persistent network issues
7db09ee

Muhammad Abdur Rahman Saad commited on

refactor logging and streamline content update process
93058c6

gavinzli commited on

update eastmoney.py
f100a8c

Muhammad Abdur Rahman Saad commited on

creation of flask api
ab755b3

Muhammad Abdur Rahman Saad commited on

Handle TimeoutError in crawl function to improve error handling
4a26d9b

gavinzli commited on

Fix retry logic in crawl function to decrement retries correctly
8d15feb

gavinzli commited on

Increase timeout for URL requests in crawl functions to enhance reliability
cc76656

gavinzli commited on

Add timeout parameter to URL requests in crawl functions for improved reliability
fed78ac

gavinzli commited on

Implement retry mechanism for URL requests in crawl function to enhance reliability
9939b16

gavinzli commited on

Add error handling for URL requests in crawl function
94ba329

gavinzli commited on

Handle IncompleteRead exception in crawl function for improved error handling
b1f8d8a

gavinzli commited on

Refactor vectorizer.py by removing commented-out code and improve error handling in stats.py during URL requests
65f1467

gavinzli commited on

Remove openai_vectorize calls from crawl function and retain vectorize for article processing
60a0016

gavinzli commited on

Refactor vectorization process by removing openai_vectorize calls and updating vectorizer initialization
5fea365

gavinzli commited on

Refactor content update process to ensure reference ID is set to None and re-enable vectorization functions in article processing
b68d569

gavinzli commited on

Handle missing reference ID by setting it to None in article data and refactor URL construction for clarity
1c87e0d

gavinzli commited on

Add reference ID extraction and implement retry logic for document addition
693e166

gavinzli commited on

Update article classification from 'site' to 'category' for policy interpretation
9853f17

gavinzli commited on

Add logging for DataFrame output in vectorize function and log content in _crawl function for better traceability
293d18b

gavinzli commited on

Refactor logging in multiple files to replace print statements with logging calls for better traceability
1c5f2e5

gavinzli commited on

Handle exceptions in _crawl function by logging errors and returning None to prevent crashes
61b6281

gavinzli commited on

Implement retry mechanism in _crawl function to handle IncompleteRead exceptions and improve URL fetching reliability
0383f38

gavinzli commited on

Handle IncompleteRead exception in _crawl function to prevent crashes during URL fetching
d710384

gavinzli commited on

Refactor exception handling in multiple files to specify exception types and improve logging
d705151

gavinzli commited on

Refactor error handling and improve logging in utils.py; update vectorization process in vectorizer.py; adjust variable naming in eastmoney.py
c39d841

gavinzli commited on

Update eastmoney.py
9b17ba3
unverified

OxbridgeEconomics commited on

Update csrc.py
122b0c4

Muhammad Abdur Rahman Saad commited on

add more logging statements
82a33ed

Muhammad Abdur Rahman Saad commited on

commit
5737030

OxbridgeEconomics commited on

fix logging issue
4259f95

Muhammad Abdur Rahman Saad commited on

Update csrc.py
91fadcf

Muhammad Abdur Rahman Saad commited on

update flow
5d719e2

Muhammad Abdur Rahman Saad commited on

commit
01677a0

OxbridgeEconomics commited on

commit
270ad28

OxbridgeEconomics commited on

commit
964df08

OxbridgeEconomics commited on

commit
74475ac

OxbridgeEconomics commited on