Filtering ECB-Related News from News Archive in Pandas DataFrame
Message:
Hello,
I am currently working on filtering a large dataset containing Reuters news from the past 20 years, which I've loaded into a Pandas DataFrame. The dataset has over 2 million news records, sourced monthly from an SFTP server. My objective is to isolate news items related to the European Central Bank (ECB) and their interest rate decisions.
Details:
- Hostname: archive.news.refinitiv.com
- Username: GE-A-01103867-3-15059
- Filepath: /News/RTRS/Monthly/
- Data Format: JSON
I am specifically looking to filter out articles tagged with "ECB" or "M:I" in the data.subjects
column, but exclude any tagged with "ECB/INT".
Current Method:My current approach uses the following Pandas code snippet:
df_clean = df[ (df['data.subjects'].str.contains('M:I|ECB', na=False)) & (~df['data.subjects'].str.contains('ECB/INT', na=False)) ]
Issue:Despite these tags being standard and correctly formatted according to the official guide, the filter returns an empty DataFrame, and there are no error messages that indicate what might be wrong.
Questions:
- Is there an error in how I'm applying the filter conditions?
- Could there be an unseen issue with how the DataFrame is structured or how the data is being read into Pandas?
Any assistance in adjusting the code or troubleshooting this issue would be greatly appreciated.
Thank you!
Best Answer
-
Thank you for reaching out to us.
From the question, I assume that you are using the News Archive product.
This forum is dedicated to software developers using Refinitiv APIs. The moderators on this forum do not have deep expertise in every bit of content available through Refinitiv products.
Please kindly contact the product support team directly via MyRefinitiv. The support team can verify and answer this question.
0
Categories
- All Categories
- 6 AHS
- 37 Alpha
- 161 App Studio
- 4 Block Chain
- 4 Bot Platform
- 16 Connected Risk APIs
- 47 Data Fusion
- 30 Data Model Discovery
- 608 Datastream
- 1.3K DSS
- 577 Eikon COM
- 4.9K Eikon Data APIs
- 7 Electronic Trading
- Generic FIX
- 7 Local Bank Node API
- Trading API
- 2.7K Elektron
- 1.3K EMA
- 236 ETA
- 519 WebSocket API
- 33 FX Venues
- 10 FX Market Data
- 1 FX Post Trade
- 1 FX Trading - Matching
- 12 FX Trading – RFQ Maker
- 5 Intelligent Tagging
- 2 Legal One
- 20 Messenger Bot
- 2 Messenger Side by Side
- 9 ONESOURCE
- 7 Indirect Tax
- 59 Open Calais
- 264 Open PermID
- 39 Entity Search
- 2 Org ID
- PAM
- PAM - Logging
- 8.4K Private Comments
- 6 Product Insight
- Project Tracking
- ProView
- ProView Internal
- 20 RDMS
- 1.4K Refinitiv Data Platform
- 367 Refinitiv Data Platform Libraries
- 3 Refinitiv Due Diligence
- LSEG Due Diligence Portal API
- 3 Refinitiv Due Dilligence Centre
- Rose's Space
- 1.1K Screening
- 18 Qual-ID API
- 13 Screening Deployed
- 23 Screening Online
- 10 World-Check Customer Risk Screener
- 990 World-Check One
- 44 World-Check One Zero Footprint
- 45 Side by Side Integration API
- Test Space
- 3 Thomson One Smart
- 1.2K TR Internal
- Global Hackathon 2015
- 2 Specialists Who Code
- 10 TR Knowledge Graph
- 150 Transactions
- 142 REDI API
- 1.7K TREP APIs
- 4 CAT
- 21 DACS Station
- 117 Open DACS
- 1.1K RFA
- 103 UPA
- 172 TREP Infrastructure
- 224 TRKD
- 886 TRTH
- 5 Velocity Analytics
- 5 Wealth Management Web Services
- 59 Workspace SDK
- 9 Element Framework
- 5 Grid
- 13 World-Check Data File
- Yield Book Analytics
- 46 中文论坛