Story image

Taking a look at the growing trend of text analytics

23 May 2018

Article written by Mathworks application manager for data analytics Seth DeLand.

Text analytics can help engineers who are performing sophisticated numerical analysis identify groups of ideas and concepts that can lead to better outcomes. The challenge with the process, however, is that the sheer volume of unstructured, raw text data sets can make it difficult for analytics tools to quickly and intuitively extract all the valuable information that may be available to the user.

For engineers in industries such as automotive manufacturing, aerospace design, industrial automation, and machinery, or energy distribution, choosing the right tools is an important early step to efficiently pulling insights from raw text. I’m going to explain how engineers can extract more value from raw text and combine that with sensor data and machine learning algorithms to improve functions like predictive maintenance.

What kind of text data is important to engineers and why? 

One of the major areas where we see text being used is in gathering and analysing data from automotive maintenance reports. For example, these maintenance reports include information from the vehicles that can prove valuable to automotive engineers. There's text in those reports from mechanics about the vehicle's service history.

At the individual level, a maintenance record describes what happened at that particular service visit. But, if automotive engineers can quickly and easily aggregate all of those reports, then there's a lot of real-world information that can be deciphered from the correlations. For example, automotive engineers could learn the vehicle's common service issues or, from a warranty perspective, understand key failures in the car that happen simultaneously. 

Topic modeling applied to mechanics notes identifies the key reasons for performing maintenance (Copyright:  1984–2018 The MathWorks, Inc.)

On the other hand, many of today's maintenance logs are digitised and generated automatically. In the industrial automation and machinery space, this could mean that -- during the operation of heavy equipment -- the text from these digitised maintenance logs could be analysed so that error messages or warnings are sent to operators prior to failure, thereby avoiding production having to be stopped.

Finally, Advanced Driver Assistance Systems (ADAS) in the automotive industry is a growing area for text analytics. When a car's camera captures images from road signs, those images need to be interpreted. Text analytics is a way to not only build models to read road signs but also to interpret the meaning of the text on those signs.

What other things are customers exploring when it comes to text analytics and maintenance?

Predictive maintenance is an area that could directly benefit from text analytics. We already talked about how being able to easily generate insights from raw text data from maintenance records can provide benefits; however, this raw text data can also help engineers build algorithms to predict failures before any warnings are sent. For example, in the off-highway commercial space, if a piece of heavy equipment breaks down, that becomes a costly failure.

We have customers producing heavy equipment that is going to be used on a construction site. When that piece of equipment fails, it results in more cost and time since the construction is stalled. For engineers in the industrial, automation, and machinery space, to be able to build algorithms that can predict these failures before they occur will prevent delays, thereby saving time and money.

One example of this would be to use text analytics to analyse maintenance logs and come up with categories of failures. These categories could be thought of as "labels" that identify the type of failure that occurred. By combining these labels with the raw sensor data from the equipment when it was in operation, an engineer could train a supervised machine learning model. That model could then be used on new sensor data to predict future failures.

Check Point announces integration with Microsoft Azure
The integration of Check Point’s advanced policy enforcement capabilities with Microsoft AIP’s file classification and protection features enables enterprises to keep their business data and IP secure, irrespective of how it is shared. 
Why AI will be procurement’s greatest ally
"AI can help identify emerging suppliers, technologies and products in specific categories."
Are AI assistants teaching girls to be servants?
Have you ever interacted with a virtual assistant that has a female-based voice or look, and wondered whether there are implicitly harmful gender biases built into its code?
Google 'will do better' after G Suite passwords exposed since 2005
Fourteen years is a long time for sensitive information like usernames and passwords to be sitting ducks, unencrypted and at risk of theft and corruption.
Hackbusters! Reviewing 90 days of cybersecurity incident response cases
While there are occasionally very advanced new threats, these are massively outnumbered by common-or-garden email fraud, ransomware attacks and well-worn old exploits.
Data#3 to exclusively provide MS licences to WA Government
The technology services provider has won two contracts with the Western Australia Government, becoming its sole Microsoft licence provider.
Why cash is no longer king in Australia
Australia is leading the way in APAC for granting credit on B2B transactions.
Informatics deepens integration with Google Cloud
The data management company has connected its solutions with Google Cloud’s big data analytics solutions.