{"id":17376,"date":"2015-09-18T12:54:13","date_gmt":"2015-09-18T12:54:13","guid":{"rendered":"http:\/\/www.businesscloudnews.com\/?p=235091"},"modified":"2015-09-18T12:54:13","modified_gmt":"2015-09-18T12:54:13","slug":"semantic-technology-is-it-the-next-big-thing-or-just-another-buzzword","status":"publish","type":"post","link":"https:\/\/icloud.pe\/blog\/semantic-technology-is-it-the-next-big-thing-or-just-another-buzzword\/","title":{"rendered":"Semantic technology: is it the next big thing or just another buzzword?"},"content":{"rendered":"<p>Most buzzwords circulating right now describe very attention-grabbing products: virtual reality headsets, smart watches, internet-connected toasters. Big Data is the prime example of this: many firms are marketing themselves to be associated with this term and its technologies while it\u2019s \u2018of the moment\u2019, but are they really innovating or simply adding some marketing hype to their existing technology? Just how \u2018big\u2019 is their Big Data?<\/p>\n<p>On the surface of it one would expect semantic technology to face similar problems, however the underlying technology requires a much more subtle approach. The technology is at its best when it\u2019s transparent, built into a set of tools to analyse, categorise and retrieve content and data before it\u2019s even displayed to the end user. While this means it may not experience as much short term media buzz, it is profoundly changing the way we use the internet and interact with content and data.<\/p>\n<p>This is much bigger than Big Data. But what is semantic technology? Broadly speaking, semantic technologies encode meaning into content and data to enable a computer system to possess human-like understanding and reasoning. There are a number of different approaches to semantic technology, but for the purposes of this article we\u2019ll focus \u2018Linked Data\u2019. In general terms this means creating links between data points <em>within<\/em> documents and other forms of data containers, rather than the documents themselves. It is in many ways similar what Tim Berners-Lee did in creating the standards by which we link documents, just on a more granular scale.<\/p>\n<p>Existing text analysis techniques can identify entities within documents. For example, in the sentence \u201cHaruhiko Kuroda, governor of Bank of Japan, announced 0.1 percent growth,\u201d \u2018Haruhiko Kuroda\u2019 and \u2018Bank of Japan\u2019 are both entities, and they are \u2018tagged\u2019 as such using specialised markup language. These tags are simply a way of highlighting that the text has some significance; it remains with the human user to understand what the tags mean.<\/p>\n<p>&nbsp;<\/p>\n<p><a href=\"http:\/\/www.businesscloudnews.com\/files\/2015\/09\/1-tagging.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-235101 size-full\" src=\"http:\/\/www.businesscloudnews.com\/files\/2015\/09\/1-tagging.jpg\" alt=\"1 tagging\" width=\"610\" height=\"98\" \/><\/a>Once tagged, entities can then be recognised and have information from various sources associated with them. Groundbreaking? Not really. It\u2019s easy to tag content such that the system knows that \u201cHaruhiko Kuroda\u201d is a type of \u2018person\u2019, however this still requires human input.<\/p>\n<p><a href=\"http:\/\/www.businesscloudnews.com\/files\/2015\/09\/2-named-entity-recognition.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-235111\" src=\"http:\/\/www.businesscloudnews.com\/files\/2015\/09\/2-named-entity-recognition.jpg\" alt=\"2 named entity recognition\" width=\"610\" height=\"261\" \/><\/a><\/p>\n<p>Where semantics gets more interesting is in the representation and analysis of the <em>relationships<\/em> between these entities. Using the same example, the system is able to create a formal, machine-readable relationship between Haruhiko Kuroda, his role as the governor, and the Bank of Japan.<\/p>\n<p><a href=\"http:\/\/www.businesscloudnews.com\/files\/2015\/09\/3-relation-extraction.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-235121\" src=\"http:\/\/www.businesscloudnews.com\/files\/2015\/09\/3-relation-extraction.jpg\" alt=\"3 relation extraction\" width=\"610\" height=\"210\" \/><\/a><\/p>\n<p>In order for this to happen, the pre-existing environment must be defined. In order for the system to understand that \u2018governor\u2019 is a \u2018job\u2019 which exists within the entity of \u2018Bank of Japan\u2019, a rule must exist which states this as an abstraction. This is called an ontology.<\/p>\n<p>Think of an ontology as the rule-book: it describes the world in which the source material exists. If semantic technology was used in the context of pharmaceuticals, the ontology would be full of information about classifications of diseases, disorders, body systems and their relationships to each other. If the same technology was used in the context of the football World Cup, the ontology would contain information about footballers, managers, teams and the relationships between those entities.<\/p>\n<p>What happens when we put this all together? We can begin to infer relationships between entities in a system that have not been directly linked by human action.<\/p>\n<p><a href=\"http:\/\/www.businesscloudnews.com\/files\/2015\/09\/4-inference.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-235131\" src=\"http:\/\/www.businesscloudnews.com\/files\/2015\/09\/4-inference.jpg\" alt=\"4 inference\" width=\"610\" height=\"208\" \/><\/a><\/p>\n<p>An example: a visitor arrives on the website of a newspaper and would like information about bank governors in Asia. Semantic technology allows the website to return a much more sophisticated set of results from the initial search query. Because the system has an understanding of the relationships defining bank governors generally (via the the ontology), it is able to leverage the entire database of published text content in a more sophisticated way, capturing relationships that would have been overlooked by computer analysis alone. The result is that the user is provided with content more closely aligned to what they are already reading.<\/p>\n<p>Read the sentence and answer the question: \u201cWhat is a \u2018Haruhiko Kuroda\u2019?\u201d As a human the answer is obvious. He is several things: human, male, and a governor of the Bank of Japan. This is the type of analytical thought process, this ability to assign traits to entities and then use these traits to infer relationships between new entities, that has so far eluded computer systems. The technology allows the inference of relationships that are not specifically stated within the source material: because the system knows that Haruhiko Kuroda is governor of Bank of Japan, it is able to infer that he works with other employees of the Bank of Japan, that he lives in Tokyo, which is in Japan, which is a set of islands in the Pacific.<\/p>\n<p>Companies such as the BBC, which Ontotext has worked with, are sitting on more text data than they have ever experienced before. This is hardly unique to the publishing industry, either. According to Eric Schmidt, former Google CEO and executive chairman of Alphabet, every two days we create as much information as was generated from the dawn of civilisation up until 2003 &#8211; and <a href=\"http:\/\/techcrunch.com\/2010\/08\/04\/schmidt-data\/\">he said that in 2010<\/a>. Five years later and businesses of all sizes are waking up to this fact &#8211; they must invest in the infrastructure to fully take advantage of their own data. You may not be aware of it, but you are already using semantic technology every day. Take Google search as an example: when you input a search term, for example \u2018Bulgaria\u2019, two columns appear. On the left are the actual search results, and on the right are semantic search results: information about the country\u2019s flag, capital, currency and other information that is pulled from various sources based on semantic inference.<\/p>\n<p><em><strong>Written by\u00a0Jarred McGinnis, UK managing consultant at <em>Ontotext<\/em><\/strong><\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Most buzzwords circulating right now describe very attention-grabbing products: virtual reality headsets, smart watches, internet-connected toasters. Big Data is the prime example of this: but how &lsquo;big&rsquo; is enterprise Big Data?<\/p>\n","protected":false},"author":7,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[156,656,3424,1297,3425],"tags":[],"class_list":["post-17376","post","type-post","status-publish","format-standard","hentry","category-big-data","category-database","category-linked-data","category-opinion","category-semantic-technology"],"_links":{"self":[{"href":"https:\/\/icloud.pe\/blog\/wp-json\/wp\/v2\/posts\/17376","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/icloud.pe\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/icloud.pe\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/icloud.pe\/blog\/wp-json\/wp\/v2\/users\/7"}],"replies":[{"embeddable":true,"href":"https:\/\/icloud.pe\/blog\/wp-json\/wp\/v2\/comments?post=17376"}],"version-history":[{"count":1,"href":"https:\/\/icloud.pe\/blog\/wp-json\/wp\/v2\/posts\/17376\/revisions"}],"predecessor-version":[{"id":17377,"href":"https:\/\/icloud.pe\/blog\/wp-json\/wp\/v2\/posts\/17376\/revisions\/17377"}],"wp:attachment":[{"href":"https:\/\/icloud.pe\/blog\/wp-json\/wp\/v2\/media?parent=17376"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/icloud.pe\/blog\/wp-json\/wp\/v2\/categories?post=17376"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/icloud.pe\/blog\/wp-json\/wp\/v2\/tags?post=17376"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}