How to remove formatting characters amid text body of news when retrieving real time news data using
Hi, one customer called me to ask the question. How to remove those invisible formatting characters amid those printable text words? Because sometimes the re-organized news story body looks like mess because of those special formatting characters. How to typeset those news body text words in order?
Best Answer
-
Hi @Liheng.Wang,
Can you elaborate what you mean by "invisible formatting characters"? The story body text can be determined by the mimeType defined within the JSON data structure - plain text.
Stories do contain <CR><LF> (Carriage Return/Line feeds) which is used for display terminals. In addition, stories can be nativly represented in other language variants. Can you also elaborate what "typeset those news body text words in order"? You mean you want to filter out certain ASCII characters like <CR> <TAB> <LF>, etc? If so, you will need to parse the body of the text and apply your own filtering.
0
Answers
-
nick.zincone.1, thank you so much for answer. Yes, that is what I exactly mean. The customer said those ASCII characters are not good enough to phrase the news body and sometimes it will make the news looking like a mess. So they want to remove all this characters and then re-construct them by its own based on the plain text. So the answer is, how to identify all these characters and filter out all of them? I believe the problem they have is about how to make the news body a purely plain text.
0 -
Hello @Liheng.Wang
Those control characters are generated from MRN data feed, not from the API or TREP. They are part of news story content and the API just sends it to the client application "as is". If the client wants to remove those ascii characters, the client needs to implement their own filter to detect and remove them in the application level.0
Categories
- All Categories
- 6 AHS
- 37 Alpha
- 161 App Studio
- 4 Block Chain
- 4 Bot Platform
- 16 Connected Risk APIs
- 47 Data Fusion
- 30 Data Model Discovery
- 608 Datastream
- 1.3K DSS
- 577 Eikon COM
- 4.9K Eikon Data APIs
- 7 Electronic Trading
- Generic FIX
- 7 Local Bank Node API
- Trading API
- 2.7K Elektron
- 1.3K EMA
- 236 ETA
- 519 WebSocket API
- 33 FX Venues
- 10 FX Market Data
- 1 FX Post Trade
- 1 FX Trading - Matching
- 12 FX Trading – RFQ Maker
- 5 Intelligent Tagging
- 2 Legal One
- 20 Messenger Bot
- 2 Messenger Side by Side
- 9 ONESOURCE
- 7 Indirect Tax
- 59 Open Calais
- 264 Open PermID
- 39 Entity Search
- 2 Org ID
- PAM
- PAM - Logging
- 8.4K Private Comments
- 6 Product Insight
- Project Tracking
- ProView
- ProView Internal
- 20 RDMS
- 1.4K Refinitiv Data Platform
- 367 Refinitiv Data Platform Libraries
- 3 Refinitiv Due Diligence
- LSEG Due Diligence Portal API
- 3 Refinitiv Due Dilligence Centre
- Rose's Space
- 1.1K Screening
- 18 Qual-ID API
- 13 Screening Deployed
- 23 Screening Online
- 10 World-Check Customer Risk Screener
- 990 World-Check One
- 44 World-Check One Zero Footprint
- 45 Side by Side Integration API
- Test Space
- 3 Thomson One Smart
- 1.2K TR Internal
- Global Hackathon 2015
- 2 Specialists Who Code
- 10 TR Knowledge Graph
- 150 Transactions
- 142 REDI API
- 1.7K TREP APIs
- 4 CAT
- 21 DACS Station
- 117 Open DACS
- 1.1K RFA
- 103 UPA
- 172 TREP Infrastructure
- 224 TRKD
- 886 TRTH
- 5 Velocity Analytics
- 5 Wealth Management Web Services
- 59 Workspace SDK
- 9 Element Framework
- 5 Grid
- 13 World-Check Data File
- Yield Book Analytics
- 46 中文论坛