The publishing industry is experiencing a period of historical change. The advent of the internet has taken many of the main publishing companies by surprise, and in the rush to develop an online strategy, they spent too much time considering how to charge for content while new media companies offering free content were springing up in the marketplace. And so began the erosion of the traditional press market, where the rules of the game started to change, and the profits from online business turned from the hands of the big publishers to someone else.
This is in addition to changes in the advertising sector: recently the amount of investment in online advertising surpassed that of print advertising in many countries (first between all USA and UK). Many people think that one of the main titles will decide to stop printing daily editions—starting a chain reaction that, in the near future, will result in only a few traditional titles still being printed, most likely dedicated to niche markets and not the mainstream news.
There are many worries about the survival of the publishing industry and a lot of debates, the most recent occurring in the heart of Silicon Valley during the week of the most important conference for semantic technology: the Semantic Technology Conference (SemTech). Here, publishing has been one of the main topics at an event involving entrepreneurs, analysts, researchers and investors.
The semantic environment is undoubtedly fascinating, especially the semantic web, of which many consider the turning point for a new era of potential on the internet–an era in which all the available information will be more intelligent, more interconnected and automatically accessible by all, users and consumers alike. It will be a new era, in which new ways to access web sources will allow us to improve not only our online experience, but also our everyday lives. The revolution and the development allowed by semantic technology are the common denominators for the debate about the surviving of publishing.
Some propose to go back in time and sell content to readers. Others, (often with big results) are releasing new online titles with a greater focus on the reader. Still others hope for new laws and legislation to stop the power of some big companies (Google, for example) and be part of the sharing of some of the business opportunities left in the market. There is also the exploitation of new access devices, like the iPad, which allows the possibility to sell content that, until now, was free.
However, in this scenario, technology offers a great opportunity for everyone. During SemTech, these opportunities have been one of the main topics of discussion: an entire section of the conference has been dedicated to the discussion of instances where semantic software applications are used and how these allow content providers to make a greater profit using the same quantity of produced content.
There has been talk about software that automatically proposes similar content to viewers, pulled from the site’s archive or from other online sources. The purpose is to simplify the research of other potentially interesting content, increase the time spent on the site, increase loyalty, and consequently, advertising profits due to more time spent online by readers. There has been a really interesting discussion about the creation of intelligent procedures to improve the user’s experience, allowing better browsing and a higher quality of search results. These mechanisms can allow you to automatically split results in a detailed way (for instance, a reader searching for articles about Japanese cooking could have several similar results: restaurant advisors, recipes etc. for an easier future search).
Another topic has been the development, for publishers and brands, of more profitable ways for online advertising that are less invasive for readers. Thanks to semantic technology, it is possible to create advertising campaigns that link automatically to other content, either to an article or links to feedback submitted by users. This procedure is increasing the possibility to create unique messages that can improve the value for investors without compromising the experience for the reader. Another interesting opportunity offered by semantic technology is the ability for content creators to make their content automatically available in the standard formats required by semantic web. This makes it possible to immediately activate all the described features and to reduce costs so that journalists and bloggers can spend more time on other, more important activities.
In summary, I’m much more convinced that this revolution will bring about historical change, and that conditions exist that will make it possible for big publishers and small innovators to live and compete together in a transparent, fully functioning market. To make this possible I think it will be necessary for big publishers to focus not only on big legal battles against existing contents sources, like free information sites. It is necessary and cheaper for them to understand and take advantage of the opportunities provided by the new technologies (hardware and software). The profit that could be made by applications already existing in the market or just in the minds of some innovators…are potentially much larger than we could imagine.
December 23, 2020: in spite of the catastrophic forecasts of some years ago, the temperature of the earth has only increased half a degree, but in the city, the youngest children have never seen snow. The Smith family is awaiting the delivery of their children’s gifts, ordered as always on Amazon.
In some hours, the family must leave to spend Christmas with the grandparents, but the gifts have still not arrived. Only ten years ago this would have been a complicated situation—made worse by the risk of having to explain the missed arrival of Santa Claus. Today, such a problem is easily resolved thanks to the revolutionary mass adoption of the Semantic Web, which happened some time ago.
Instead of waiting at home for the packages to arrive, Mrs. Smith can modify her current online profile in her personal data locker, a complete archive that guards all personal data, managed directly by the individual. By fostering a collaborative environment, the Semantic Web has dismantled the countless customer archives and systems for the management of shipments, unique for each courier: in 2020, new shipment information catches up with the couriers in real time, and the gifts could instead be delivered to the grandparents’ address. The change of shipping address doesn’t require a telephone call to the customer, nor further modification of shipping procedures.
This small example is just one of the revolutionary scenarios used by this year’s keynote speaker at SemTech, David Siegel, author of “Pull: The Power of the Semantic Web to Transform Your Business,” to explain in practical terms the scale of the changes we can expect from the Semantic Web. According to Siegel’s brave vision, in the new world of the Semantic Web, it will be individuals and consumers who will become the primary point of reference for the market, and therefore it will substantially change each linked aspect of the information.
It will not be necessary to duplicate the data in every database of the organizations and companies in every sector in which we are operating or doing business (for example, think about the personal data necessary for login forms), solely to satisfy the needs of an antiquated, disconnected system. According to Siegel, in the era of the Semantic Web, it will fall to individuals to exactly define the rules of the game, creating a mechanism—a “pull” —so that suppliers of products and services will have the responsibility to retrieve information about its customers (instead of vice versa.) A mechanism that, according to some estimates, will help save thousands of billions of dollars every year.
In the subsequent debate following his presentation, David Siegel was not without critics. There are those who accused him of not estimating the technical aspects of such a model (for example, where will this mysterious personal data locker reside, how will we ensure it is securely managed?, etc.)
Personally, I found Siegel’s book and presentation very interesting. If we really want the new era of the Web to arise, it is necessary to initially explain the concrete advantages for consumers. Surely not everything will be developed as Siegel predicts, but his theory has merit, because it makes the motives and utilization of the Semantic Web more clear.
In the second keynote of the day, David Recordon from Facebook introduced the “Open Graph protocol” initiative. Released by the company some weeks ago, this initiative has opened an important view of what our online future could be. Through this new protocol, created by Facebook but available to all of its users, every web page will have the possibility to integrate itself in the social graph of Facebook through a simple copy and paste of code. This opens the door, for example, to the development of applications that automatically take into account each user’s social network.
For example, think about the personalization of search rankings for a classic site dedicated to restaurants or movies, according to the preferences of our friends who have already had eaten in the restaurant or who have already seen the movie. Although the first applications of this protocol already exist, the real revolution will happen in the coming years, and we can bet that new applications will know how to fully take advantage of this innovation.
Now that SemTech has drawn to a close, to give a combined vision of the current state of the industry, as well as other perspectives on the sector, I want to briefly point to the forecasts and data that emerged from the panel dedicated to investments and acquisitions in the field. In the last 12 months, we have seen a significant acceleration of activity. We have seen acquisitions of semantic technology companies from larger organizations or from new companies, plus a significant and increasing number of investments in start ups, especially in the areas of sentiment analysis, semantic publishing and semantic advertising. Therefore, it is clear that the field enjoys optimal health and we will continue to have interesting developments in the near future (stay tuned).
All in all, it has been another positive event. A larger participation of big companies like Procter & Gamble, Chevron, Lockheed Martin, CNN, etc, have given greater validity to the conference, confirming the business value of the semantic technology. The attention of some big names, from Facebook to Google, Apple to The New York Times, strengthens the vision of a Web and its consumer applications that will be diverse and innovative, even if not completely defined. If the recent increase in media interest is any indication (demonstrated by a significant increase in the number of articles dedicated to semantic technology in recent months), we can believe that the best is yet to come, and I really hope that I will be talking about it in the 2011 edition.
I think it’s really interesting to think about some of the numbers pointed out in this article. To know that 34% of US adults use social media to comment and share their opinions about companies, brands and products is really surprising: if they ever asked me to estimate this percentage, I would have never guessed more than 10% (and probably less). If they were talking about young people or teenagers it would be normal, but the survey says that this is diffused also for users between 34 and 54 years old.
It also surprised me that so many are posting positive comments. I would have expected that many people would use social media as a tool to vent their dissatisfactions and complaints about products and services that are not providing the expected result. Obviously the social side of internet is also bringing out people’s good intentions (not just the bad :-), so it becomes a pleasure to recommend something we liked to others.
For those of us working in technology and information management, these numbers are really important. They demonstrate how necessary it is for companies to analyze the social content of the internet to know what users and consumers think about their products or services. To have access to this information is really easy (they are free and usually open source), but being able to process them in an efficient and useful way is extremely difficult: this is a perfect task for semantic technology–it cannot make miracles, but it allows you to target relevant information in a more efficient way than other technologies.
The idol (and surprise) of the first day of the Semantic Technology Conference in San Francisco has a very different profile than what you may expect, at least at an event that brings together some of the most brilliant minds from one of the more innovative sectors in the world of software. His name is Landon Donovan, a professional soccer player, and he was the only one who has been able to successfully unite all of the participants, after just 90 minutes, by making the goal that qualified the United States for the next stage of the World Cup. This unforeseen fortune has thus transformed the hundreds of geeks that fill the hall of the Hilton before the start of the conference, into a crowd of excited fans in a stadium.
Apart from Donovan, the first day of SemTech was surely in line with expectations. As the organizers promised, the presentations have been concentrated on the business rather than the purely technological aspects. Compared to last year’s conference, the topics have been more about case studies, ROI and the real needs of customers.
Among the presentations of this full program, I found of particular interest the series dedicated to the application of semantic technology to marketing. In a market of consumers who are increasingly influenced by opinions expressed online (websites, blogs or forums), by now it has become strategic for all businesses to institute mechanisms for real-time monitoring. By analyzing these sources, it is possible to precisely identify opinions, sentiment and new trends that can shift the competitive landscape.
Considering the immense quantities of available data and information to analyze daily, an effective solution requires a system that can process content automatically and reliably. Semantic technology can help, thanks to its capacity to understand the significance of words and their relation to expressed concepts.
The panel, including some of the main vendors in this very crowded sector, has rightly discussed the problems of complexity in fully realizing a semantic system and the often exaggerated expectations by companies new to the sector. The discussion of the necessity to integrate raw data (for example, that it measures sentiment regarding product features, extracted automatically), as well as analysis made by specialized consultants, has been particularly enlightening. Right now, the match is tied on this particular topic and we will probably still see some adjustments before these listening systems will be fully implemented in the marketplace.
A second very interesting presentation was that of SalesForce (www.salesforce.com). The company presented a semantic system that, through analysis of content added by employees, can create maps that summarize in real time those employees’ abilities and interests. This knowledge map is used to allow everyone who is looking for a particular set of skills to be able to query the system and easily retrieve the profiles of employees best suited to manage a particular data problem. This type of instrument can be very useful for companies with employees in diverse geographical locations, but also in particular situations that require fast access to unique or less common skills, as well as other diverse skills and interests that are not directly related to an employee’s current responsibilities.
The final note for the day is dedicated to an interesting online application (www.tripit.com) that provides a very useful tool for travellers—one that allows you to easily create a travel itinerary with all the useful information you need, as well as an electronic copy for your trip. For a simpler, more transparent user experience, Tripit uses semantic technology to personalize relevant information inside an email to confirm a travel reservation (for example, dates, location, name of hotel, etc.). The details automatically load into the email, including logistical information like the weather forecast, maps of the area, etc. And finally, the itineraries are available via the web or mobile (which is especially useful when travelling), allowing the traveller to avoid having to keep track of paper copies of reservations or confirmation numbers.
This year’s edition of the Semantic Technology conference has the clear objective to prove, to a still partially skeptical market, that semantic technologies and the broader semantic web are for real, and that they can significantly contribute to create business value to organizations in many different industries. Just by looking at the comprehensive agenda, organized very smartly in tracks covering specific topics and areas of applications, I was pleased to realize that speeches and panels, instead of focusing on standard and technical aspects like in past years, will cover concrete business applications that are easy to understand, even for people who have never written a line of code.
Considering these points, and last but not least, the fact that the event is in San Francisco (apologies to San Jose…), I am really looking forward to the conference to start, also because the event represents a unique opportunity to meet entrepreneurs, analysts and investors to understand whether their mood and vision on the future of the sector has changed after the difficult recession of the last 18 months. Among the long list of presentations, I am particularly interested in the following:
Personally, I will make a speech on Wednesday at 10.15 inside a panel dedicated to the application of semantic technologies to publishing. I find fascinating, and at the same time particularly confusing, the debate in this sector. On one side, you have the traditional players that are under a great deal of pressure to try to retake a leadership position, and financial profitability, after having allowed Google to almost destroy their business model and weaken significantly their competitive position. On the other side, you have the new online only players, like the Huffington Post, that are forced by their somehow unexpected success to continue to innovate to ensure they can continue to offer a unique a difference experience to their readers. Both sides see semantic technologies as strategic because these technologies, and the applications deriving from them, can help them to increase revenue by improving the user experience and providing a more effective way to serve advertising, and to reduce costs by automating the work of content creators in order to let them focus on the most valuable part of their job (creating content) instead of wasting time in low value activities like manually tagging the content to make it easier to search and access by the user.
If you are in San Francisco this week, don’t forget to visit Expert System’s booth (#207) or to follow us on this blog or on twitter @scagliarini and @brookeaker.
Last week I organized “Semantic Web Makes for Business Intelligence 2.0”, a meetup event for the CT Semantic Web Meetup Group (Ah yes, I funded the Connecticut Semantic Web Group last April. The reason was simple: we had the NYC and Boston version of these groups but didn’t have one of them in CT. So how about having some meetups right here in CT to learn, trade stories and promote the Semantic Web / Web 3.0 right here?). And this week I will take part in an other Semantic Web event which seems very interesting and was scheduled as part of Internet Week NY 2010: “Building NLP Semantic Web Applications for Financial Services“.
You’ll find all the details here. But what’s the point? This is sea-change technology and early adopters are in corporate communities. The technology is here now and coming on fast. Come to these meetups so you and your company are out in front and not left behind.
In the film “Dead Poets Society”, Professor John Keating asks his students to rip out the introduction to their poetry book which describes how to rate the quality of poetry using a Cartesian system of coordinates – the system is a kind of “mathematical” evaluation based on “measurable” criteria. While we can agree that a “mechanical” approach to poetry evaluation is most likely nonsensical, we asked ourselves if this kind of approach (along with the use of an automatic system) could actually be used to examine text which has nothing to do with poetry, such as a political speech.
The political debates currently being held in the UK are the perfect occasion to satisfy our curiosity. The question is, can this situation can be considered sufficiently “scientific”, in that the three candidates all have to respond to the same questions during a live conference with a time limit for each response?
My colleague, Marco Giorgini and I have spent some time creating a small program which uses our semantic software COGITO to analyze the debates and extract the elements which appear to be significant. We had some fun as we examined our results and found out what we could understand about a political campaign if we didn’t bother to look at each candidate, their facial expressions, their tone of voice, or the cadence of the discussions. We just limited ourselves to “chopping up” what was said and evaluating the most abstract essence of the concepts discussed, the use of the lexicon and the grammatical structures.
Obviously, we just played around with the language, but we were also able to make some interesting, if not curious, considerations. This report provides the details as to what we found.
A special thank-you to Marco Giorgini who worked directly on the development of the report as well as the layout of this post.
Italy seems to be a popular subject in the daily buzz published in the press (online and offline), in blogs and in other forms of social networking. The interest in this small country is certainly greater than it’s political, geographic or strategic role.
There are an infinite amount of declarations of love regarding: food, landscapes, history, “dolce vita”, Ferrari and beauty (especially in regards to the players on the national soccer team). These are counterbalanced by strong negative opinions concerning: disorganization, corruption, politics and the Italian stereotype of mammismo (aka ‘momism’). It seems like the world tends to be polarized in judging Italy using subjective emotional criteria instead of the same objective and modern criteria which is usually reserved for the leading nations of the 21st century.
The “Italy of Innovators” project (P.S. Expert System is one of the selected companies) is actually an excellent example of how Italy is also a modern country which pays attention to technology and is full of innovative companies who struggle to put their ideas on the market despite the financial, academic and political systems which are a far cry from the Silicon Valley.
So here’s my suggestion to all of the international investors, entrepreneurs and university professors who follow my blog (OK, perhaps I’ve exaggerated a bit ): set down your glass of Chianti, stop listening to Turandot, start reading the list of these companies and keep these innovators in mind during your next vacation to the boot-shaped country. Take a chance to set up a meeting with these entrepreneurs…it might just lead to new investment opportunities!
Note: for those would like read more about Italy, I suggest books by Beppe Severgnini who has also inspired some of the opinions I expressed in this post.
Many technology companies have managed to flood the market with their “magic” automatic solutions. They promise precision tools which are simple to use, have enticingly colorful dashboards and provide full coverage of “consumers sentiment” with minimal effort. Many of their web sites state that once top executives are equipped with their magic tool, the task of reading and interpreting the extracted data can be easily delegated to their subordinates.
As I am directly involved with this aspect with my company, I can’t seem to figure out if this claim is just an aggressive marketing campaign or if it is just a way to tell customers what they want to hear. Either way, it has created a situation where the only dialogue with the potential customers is based on looking at the data contained these reports and trying to find the errors in the extraction, thus casting doubts on whether the technology is actually ready for the market.
This vicious cycle needs to be stopped. It could be useful to remember the following guidelines to create a more productive climate and provide value both to businesses and to technology companies:
1) Online sentiment analysis is just another element of competitive intelligence and should be handled accordingly
2) Even the more established technologies (ERP, BI, CRM) are not perfect and do have a margin of acceptable error
3) A group of analysts should be employed to examine the data, extract the knowledge and scan the sources
4) Set priorities and be conservative: avoid incidents first
5) Learn how messages propagate
The current economic crisis is creating new possibilities for semantic technology. Businesses need to cut costs (which is usually done by downsizing personnel) without making any cutbacks in the quality of their services. No one can afford to let quality go downhill, because once it does, the competition is more than ready to pounce on the opportunity.
The fact that there is growing interest in semantic solutions (like automatic categorization and text mining) is obviously a positive factor (for us, at least :-), but the crisis has also complicated the situation. Companies express interest in implementing innovative solutions, however their present-day budgets are quite limited, especially when it comes to purchasing software licenses. This creates a kind of catch-22 situation because investing in innovative technology is often the key to improving efficiency within a company (when correctly implemented.)
It is very important to thoroughly weigh the pros and cons of this situation. Cost may seem to be a problem at first (because it may exceed the allotted budget funds), but in reality it is actually quickly absorbed thanks to the level quality which is gained (quality which was not compromised by the need to reduce costs). This type of approach often makes it possible to find a solution which benefits both sides (customer and supplier) and guarantees an outcome which lives up to expectations. Of course, flexibility is required on both ends and supplier experience is essential to minimize implementation risks.
If a budget is truly minimal, it is usually best not to play around with money. Money should never be wasted, especially if it’s scarce. Although sometimes it is actually possible to reduce the final objectives and focus only on the fundamental aspects and still be able to obtain positive results.
Sooner or later this crisis is bound to end! Those who invested wisely during these hard times will have an advantage over those who chose not to take any risks.