{"id":238,"date":"2018-02-12T12:19:46","date_gmt":"2018-02-12T12:19:46","guid":{"rendered":"https:\/\/wordpress.www.revuze.it\/?p=238"},"modified":"2018-02-12T12:19:46","modified_gmt":"2018-02-12T12:19:46","slug":"why-text-analytics-and-nlp-are-not-the-answer","status":"publish","type":"post","link":"https:\/\/www.revuze.it\/blog\/why-text-analytics-and-nlp-are-not-the-answer\/","title":{"rendered":"Why text analytics and NLP are NOT the answer?"},"content":{"rendered":"<div class=\"entry-content\">\n<p>With <a href=\"https:\/\/www.revuze.it\/blog\/90-of-the-data-was-created-recently\/\" target=\"_blank\" rel=\"noopener noreferrer\">90% of the world\u2019s data created in the last 2 years<\/a> the world data is growing at a scary pace\u2026there are so many ways now for consumers to share data and information that organizations everywhere need to analyze and deal with textual data. Obvious examples are customer service (returns, complaints), QA (failures, missing parts, packaging), product (popular features, negative reviews, competitive analysis) and market research (analyzing brands, products and sentiment).<\/p>\n<p>With so much text to look into it just make sense to leverage technology to help you slice it into buckets and areas of interest. This is where Text Analytics and Natural Language Processing (NLP) come it.<\/p>\n<p><strong>\u00a0<\/strong><\/p>\n<p><strong>So what are Text Analytics or NLP?<\/strong><\/p>\n<p>Text analytics (Sometimes referred to as\u00a0text data mining) is the process of deriving high quality\u00a0information\u00a0from\u00a0text. This is typically achieved through finding patterns and trends by means such as\u00a0statistical pattern learning. Text analytics usually involves structuring the input text (usually parsing, along with the addition of some derived linguistic features and the removal of others, and inserting into a\u00a0database), deriving patterns within the\u00a0structured data, and finally evaluation and interpretation to make meaningful observations. Text analytics typically doesn\u2019t involve the semantics in the text and is more about text patterns discovery.<\/p>\n<p>NLP is a component of text analytics that performs a special kind of linguistic analysis that helps a machine \u201cread\u201d text. NLP is about understanding Natural Language, as Natural language is what humans use for communication. The data could be speech or text and as such the main goal is to understand what is the semantic meaning of it.<\/p>\n<p>NLP and text analytics are complimentary, where typically text-mining uses NLP, because it makes sense to mine the data when you understand the data semantically<\/p>\n<p><strong>\u00a0<\/strong><\/p>\n<p><strong>How does NLP work?<\/strong><\/p>\n<p>First, the computer must understand what each word is. It tries to understand if it\u2019s a noun or a verb, if it\u2019s past or present tense, and so on. This is called Part-of-Speech tagging (POS).<\/p>\n<p>NLP systems also have a vocabulary and a set of grammar rules coded into the system. Modern NLP algorithms use statistical machine learning to apply these rules to the natural language and determine the most likely meaning behind what was said.<\/p>\n<p>The end goal is to have the computer understand the meaning of what was said\/written. This is challenging as some words may have several meanings (polysemy) or different words having similar meanings (synonymy), but developers encode rules into their systems and train them to learn to apply the rules correctly.<\/p>\n<p><strong>\u00a0<\/strong><\/p>\n<p><strong>So where is the problem?<\/strong><\/p>\n<p>The short answer is humans training NLP systems to \u201cread\u201d natural language. They put in a vocabulary and set of rules for the software to look for these words as a way to figure out meaning. The problem is that language is constantly evolving, and younger people create new ways of expressing yourself around a topic that didn\u2019t exist before. How can you train a machine to look for something that doesn\u2019t exist yet? Obviously once you realized that there is a new way to talk about a topic you now need to bring back the experts to train the system again to recognize the new keywords, which is time consuming and likely costly. At the end what it means is that you missed the bus\u2026by the time you realize there is a new way to talk about something that is important to you likely the train had left the station and you missed the<\/p>\n<p>meaning of this.<\/p>\n<p>Lets pick and example. Lets say we\u2019re a smartphone brand and want to analyze what consumers are saying about our latest phone\u2019s battery life. We can try to scan online reviews and search for variations of the word \u201cBattery\u201d, but what happens if consumers are using phrases such as \u201cdoesn\u2019t last long enough\u201d or \u201cphone died on me in the middle of the work day\u201d?<\/p>\n<p><strong>\u00a0<\/strong><\/p>\n<p><strong>What\u2019s the right way to do things?<\/strong><\/p>\n<p>With Artificial Intelligence and Self Training algorithms you can skip the person-training-machine steps which limit the scope of the machine understanding and is also slow in terms of response time and skip directly to a machine-training-machine scenario, growing to unlimited scale and immediate response to any variation of a meaning.<\/p>\n<p><strong>Conclusion<\/strong><\/p>\n<p>Current NLP technologies rely on humans and thus are slow to setup, miss a lot of the meaning<\/p>\n<p>in texts and are slow to adapt. In a world where <a href=\"https:\/\/www.revuze.it\/blog\/90-of-the-data-was-created-recently\/\" target=\"_blank\" rel=\"noopener noreferrer\">90% of the world\u2019s data created in the last 2 years<\/a> you can\u2019t rely on humans or manual labor to figure things out.<\/p>\n<p>The good news is that now there is enough data to make sure you can get answers to your questions, and all you need is just to analyze the data. Revuze is an innovative technology vendor that addresses just this with the first self-training, fast setup and low touch solution that typically delivers 5-8X the data coverage compared to anything else, and it does it without humans\u2026<\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>With 90% of the world\u2019s data created in the last 2 years the world data is growing at a scary pace\u2026there are so many ways now<\/p>\n","protected":false},"author":34,"featured_media":239,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"inline_featured_image":false,"footnotes":""},"categories":[1],"tags":[],"acf":[],"_links":{"self":[{"href":"https:\/\/www.revuze.it\/blog\/wp-json\/wp\/v2\/posts\/238"}],"collection":[{"href":"https:\/\/www.revuze.it\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.revuze.it\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.revuze.it\/blog\/wp-json\/wp\/v2\/users\/34"}],"replies":[{"embeddable":true,"href":"https:\/\/www.revuze.it\/blog\/wp-json\/wp\/v2\/comments?post=238"}],"version-history":[{"count":0,"href":"https:\/\/www.revuze.it\/blog\/wp-json\/wp\/v2\/posts\/238\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.revuze.it\/blog\/wp-json\/wp\/v2\/media\/239"}],"wp:attachment":[{"href":"https:\/\/www.revuze.it\/blog\/wp-json\/wp\/v2\/media?parent=238"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.revuze.it\/blog\/wp-json\/wp\/v2\/categories?post=238"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.revuze.it\/blog\/wp-json\/wp\/v2\/tags?post=238"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}