Additional Ouput Columns for get_news_headlines in Refinitiv Data Library

Hi, I'm running the following code for a group of ISINS:

for isin in ISIN_List:

query = rd.news.get_headlines("ISIN:"+isin+ " and Topic:CVRSY")

query


In the output it seems like the default columns are, "VersionCreated", "headline", "StoryId", and "sourceCode". I was wondering if there was any more info or metadata we could get in additional columns. Some specific asks would be a column for the ISIN associated with the headline, and a way to get all topics associated with the headline. We currently have the topic set to Controversies, but there is a view in the News Monitor itself that shows all topics that the headline is associated with so I'm wondering if I can get that through the API itself. Outside of these 2 just curious in general what other possible data there is

Best Answer

  • Hi @cole

    You can execute the New Headline call at the content layer within the library. For example:

    isin = 'US45766X2053'
    response = news.headlines.Definition(
    query=f"ISIN:{isin} and Topic:CVRSY"
    ).get_data()
    response.data.df

    Doing this, now allows you to get raw data from the platform which exposes metadata within your response. For example:

    response.data.raw

    1707773883181.png

Answers

  • Great, thank you for helping with both of my questions today
  • I'm getting the following error when i try to do this:

    AttributeError                            Traceback (most recent call last)
    Input In [46], in <cell line: 1>()
    ----> 1 response = news.headlines.Definition(
    2 query=f"ISIN: and Topic:MNGISS and Language:LEN",start='2023-09-29T00:00:00',end='2023-09-29T23:59:00', count = 250
    3 ).get_data()
    4 response.data.df

    AttributeError: module 'refinitiv.data._access_layer.news' has no attribute 'headlines'
  • I fixed that first one, I imported news wrong. It does not appear that this is possible with a date range though. This the actual function I'm running now:

    TypeError                                 Traceback (most recent call last)
    Input In [53], in <cell line: 1>()
    ----> 1 response = news.headlines.Definition(
    2 query=f"ISIN: and Topic:MNGISS and Language:LEN",start='2023-09-29T00:00:00',end='2023-09-29T23:59:00', count = 250
    3 ).get_data()
    4 response.data.df

    TypeError: __init__() got an unexpected keyword argument 'start'
  • @cole

    Please run the help command to verify its parameters.

    help(news.headlines.Definition)

    It accepts the following parameters.

     |  date_from: str or timedelta, optional
    | Beginning of date range.
    | String format is: '%Y-%m-%dT%H:%M:%S'. e.g. '2016-01-20T15:04:05'.
    |
    | date_to: str or timedelta, optional
    | End of date range.
    | String format is: '%Y-%m-%dT%H:%M:%S'. e.g. '2016-01-20T15:04:05'.
  • Thank you for explaining
  • Is there any guide or documentation to help with interpreting the metadata? Just trying to understand exactly what I'm seeing
  • Hi @cole ,

    You can check this /data/news/v1/headlines reference guide in the API playground

    1708414817404.png