Commit History
Update sohu_ccef.py
003e00d
unverified
OxbridgeEconomics
commited on
add exponential backoff logic for persistent network issues
7db09ee
Muhammad Abdur Rahman Saad
commited on
Add Data Source [CCEF]
f255707
Add Data Source [CCEF]
a13f43a
refactor logging and streamline content update process
93058c6
update eastmoney.py
f100a8c
Muhammad Abdur Rahman Saad
commited on
creation of flask api
ab755b3
Muhammad Abdur Rahman Saad
commited on
Handle TimeoutError in crawl function to improve error handling
4a26d9b
Fix retry logic in crawl function to decrement retries correctly
8d15feb
Increase timeout for URL requests in crawl functions to enhance reliability
cc76656
Add timeout parameter to URL requests in crawl functions for improved reliability
fed78ac
Implement retry mechanism for URL requests in crawl function to enhance reliability
9939b16
Add error handling for URL requests in crawl function
94ba329
Handle IncompleteRead exception in crawl function for improved error handling
b1f8d8a
Refactor vectorizer.py by removing commented-out code and improve error handling in stats.py during URL requests
65f1467
Remove openai_vectorize calls from crawl function and retain vectorize for article processing
60a0016
Refactor vectorization process by removing openai_vectorize calls and updating vectorizer initialization
5fea365
Refactor content update process to ensure reference ID is set to None and re-enable vectorization functions in article processing
b68d569
Handle missing reference ID by setting it to None in article data and refactor URL construction for clarity
1c87e0d
Add reference ID extraction and implement retry logic for document addition
693e166
Update article classification from 'site' to 'category' for policy interpretation
9853f17
Add logging for DataFrame output in vectorize function and log content in _crawl function for better traceability
293d18b
Refactor logging in multiple files to replace print statements with logging calls for better traceability
1c5f2e5
Handle exceptions in _crawl function by logging errors and returning None to prevent crashes
61b6281
Implement retry mechanism in _crawl function to handle IncompleteRead exceptions and improve URL fetching reliability
0383f38
Handle IncompleteRead exception in _crawl function to prevent crashes during URL fetching
d710384
Refactor exception handling in multiple files to specify exception types and improve logging
d705151
Refactor error handling and improve logging in utils.py; update vectorization process in vectorizer.py; adjust variable naming in eastmoney.py
c39d841
Update eastmoney.py
9b17ba3
unverified
OxbridgeEconomics
commited on
Update csrc.py
122b0c4
Muhammad Abdur Rahman Saad
commited on
add more logging statements
82a33ed
Muhammad Abdur Rahman Saad
commited on
commit
5737030
OxbridgeEconomics
commited on
fix logging issue
4259f95
Muhammad Abdur Rahman Saad
commited on
Update csrc.py
91fadcf
Muhammad Abdur Rahman Saad
commited on
update flow
5d719e2
Muhammad Abdur Rahman Saad
commited on
commit
01677a0
OxbridgeEconomics
commited on
commit
270ad28
OxbridgeEconomics
commited on
commit
964df08
OxbridgeEconomics
commited on
commit
74475ac
OxbridgeEconomics
commited on