Semantic Web Final
Semantic Web Final
Semantic Web
The Semantic Web is a "man-made woven web of data" that facilitates machines to understand the semantics, or meaning, of information on the World Wide Web.It extends the network of hyperlinked human-readable web pages by inserting machine-readable metadata about pages and how they are related to each other, enabling us to access the Web more intelligently and perform tasks on behalf of users. The term was coined by Tim Berners-Lee, the inventor of the World Wide Web and director of the World Wide Web Consortium ("W3C"), which oversees the development of proposed Semantic Web standards. He defines the Semantic Web as "a web of data that can be processed directly and indirectly by machines." The Semantic Web is a web that is able to describe things in a way that computers can understand. The word semantic stands for the meaning of: The Semantic Web = a Web with a meaning ie. for the computer. In the Semantic Web data itself becomes part of the Web and is able to be processed independently of application, platform, or domain. This is in contrast to the World Wide Web which contains virtually boundless information in the form of documents. We can use computers to search for these documents, but they still have to be read and interpreted by humans before any useful information can be used. Computers can present you with information but cant understand what the information is to display the data that is most relevant in a given circumstance. The Semantic Web is about having data as well as documents on the Web so that machines can process, transform, assemble, and even act on the data in useful ways.
CSPIT(IT) 1
10IT102
Semantic Web
2. Introduction
Sentences can be understood by people. But how can they be understood by computers? Statements are built with syntax rules. The syntax of a language defines the rules for building the language statements. But how can syntax become semantic? This is what the Semantic Web is all about ie.describing things in a way that computers applications can understand it.
The Semantic Web is an information space used by machines rather thanhumans. Instead of processing and manipulating Web information, a userwould have a personal agent on his/her computer that would solve problemsrelated to information overload, acquisition and discrepancy resolution.Once an agent has executed the starting level of information management, a userwould access or manipulate the results. In order to execute these tasks, theinformation the agents uses has to be presented in an increasing semantically enriched format by means of several technology layers.In both these versions of the Semantic Web architecture, a higher level layer language use the syntax and semantics of its immediate lower level layer. The Semantic Web is a mesh of information linked up in such a way as to be easily processable by machines, on a global scale. You can think of it as being an efficient way of representing data on the World Wide Web, or as a globally linked database. The increasing usage of the current World Wide Web leads to a new challenge of optimizingthe interchange of information, due to the fact that a huge amount of data is interpretable by humans only. Currently the focus of a W3C working group, the Semantic Web vision was conceived by Tim Berners-Lee, the inventor of the World Wide Web. Calling it the next step in Web evolution, Berners-Lee defines the Semantic Web as a web of data that can be processed directly and indirectly by machines.
Semantic Web technologies helpseparate meanings from data,document content, or application code,using technologies based on openstandards. You can think of the Semantic Web asan efficient way to represent data onthe World Wide Web, or as a databasethat is globally linked, in a mannerunderstandable by machines, to thecontent of documents on the WebPage.
CSPIT(IT)
10IT102
Semantic Web
3. A Vision of Possibilities
Semantic web on the cusp of something big.The Semantic Web is built up by techniques such as XML, RDF, ontologies and logic. The content of the Semantic Web is represented by ontologies and metadata. Further the definitions will be extended by axioms,algorithms. Trust and proof can be applied by the use of digital signatures. Figure of the 1: The layer structure Semantic Web.
10IT102
Semantic Web
The Semantic Web is regarded as an integrator across different content, information applications and systems. Rapid growth in the volume of data on the web doesnt always
CSPIT(IT) 4
10IT102
Semantic Web
provide the right and exact data.The Semantic Web landscape is charted and a brief summary of related terms and enabling technologies is presented.
CSPIT(IT)
10IT102
Semantic Web
RDF was developed by people with academic background in logic and artificial intelligence. For traditional developers it is not very easy to understand.One fast growing language for building semantic web applications is RSS. The Semantic Web agent does not include artificial intelligence rather, it relies on structured sets of information and inference rules that allow it to understand the relationship between different data resources. The computer doesnt really understand information the way a human can, but it has enough information to make logical connections and decisions.
CSPIT(IT)
10IT102
Semantic Web
4.2 Need
The idea of a semantic web, able to describe and associate meaning with data necessarily involves more than simple HTML mark-up code. It is based on an assumption that in order for it to be possible for machines to accurately interpret web content, far more than the mere (natural language) ordered relationships involving letters and words is necessary - as underlying infrastructure attendant to semantic issues. If information about music, cars, tickets, etc. were stored in RDF files, intelligent web applications could collect information from many different sources, combine information, and present it to users in a meaningful way. Information like this:
Car prices Information about movies Information about books (price, pages, editor, year) Dates of events Computer updates
The semantic web augments the current web with formalised knowledge and data that can beprocessed by computers. Some services will mix human readable and structured data so thatthey can be used by both humans and computers. Others will support only formalized knowledge and will only be used by machines. This will enable: computers to assist human users in tasks; the computers can understand the data inways they cannot today, the creation of a more open market in information processing and computer servicesenabling the creation of new applications and services from combinations of existingservices. It will be beneficial for the society as a whole: for the economy because it will allowcompanies to better interoperate and to quickly find the best opportunities. It will benefitcitizens because it will support them in their day-to-day work, leisure and interaction withorganisation and because it will help them to enforce the degree of control they want overtheir personal data, preferences, etc.). Nevertheless, for the semantic web to happen, it is not just a matter of technology. It involvestechnology, economics, and social matters. A trade-off must be found between these domainsthat could lead to a value adding, appealing, and easy to use semantic web.
CSPIT(IT)
10IT102
Semantic Web
4.3 Challenges
The goal of the semantic web is to be a web talking to machines, i.e. in which machines canprovide a better help to people because they can take advantage of the content of the Web. Theinformation on the web should thus be expressed in a meaningful way accessible to computers.The key requirement for the semantic web is interoperability. If machines have to takeadvantage of web resources, they must be able to access them and use them. They must be given some freedom and be able to publishe information in a human or tool-oriented format: theymust be invokable and published in an open, structured and rich format that let the machinesmake the best out of them. One of the challenges of the current semantic web developments is the design of a frameworkin which all these understanding can collaborate, because the full benefit of the semantic webcan only be attained when computers relate resources from various sources. One of the greatest challenges for the Semantic Web is achieving web-scale. While the information retrieval community has developed successful strategies for coping with the scale of the web using statistical techniques, semantic web technologies are still struggling with scaling up to the web as such. This is in part due to the need to preserve the datas structure and the need to perform various forms of reasoning in order to more effectively leverage the available information.To handle vast amount of data. Some of the challenges for the Semantic Web include vastness, vagueness, uncertainty, inconsistency, and deceit. Automated reasoning systems will have to deal with all of these issues in order to deliver on the promise of the Semantic Web. Vastness: The World Wide Web contains at least 24 billion pages as of this writing (June 13, 2010). The SNOMED CT medical terminology ontology contains 370,000 class names, and existing technology has not yet been able to eliminate all semantically duplicated terms. Any automated reasoning system will have to deal with truly huge inputs. Vagueness: These are imprecise concepts like "young" or "tall". This arises from the vagueness of user queries, of concepts represented by content providers, of matching query terms to provider terms and of trying to combine different knowledge bases with overlapping but subtly different concepts. Fuzzy logic is the most common technique for dealing with vagueness. Uncertainty: These are precise concepts with uncertain values. For example, a patient might present a set of symptoms which correspond to a number of different distinct diagnoses each with a different probability. Probabilistic reasoning techniques are generally employed to address uncertainty.
CSPIT(IT) 8
10IT102
Semantic Web
Inconsistency: These are logical contradictions which will inevitably arise during the development of large ontologies, and when ontologies from separate sources are combined. Deductive reasoning fails catastrophically when faced with inconsistency, because "anything follows from a contradiction". Defeasible reasoning and paraconsistent reasoning are two techniques which can be employed to deal with inconsistency. Deceit: This is when the producer of the information is intentionally misleading the consumer of the information. Cryptography techniques are currently utilized to alleviate this threat. Eg. E-commerce, data searching.
CSPIT(IT)
10IT102
Semantic Web
I Sell Application (ISA) When you submitted the form, the application would ask you for more information and store your ID and the information in an RDF file made available to the web. The RDF file would contain information like: Your ID: Name, address, email, ID number. Your selling item: type, model, picture, price, description. Behind the scenes, the "ISA" application creates an RDF file with a lot of RDF pointers. It creates an RDF pointer to a file with information about you. An RDF pointer is a pointer (actually an URL) to information about things (like a knowledge database). The beauty about this is that you don't have to describe yourself, or the car model. The RDF application will sort it out for you. RDF is data about data - or metadata. Often RDF files describe other RDF files. Will it ever be possible to link all these RDF files together and build a semantic web? I dont think the semantic web will work all by itself. It will need some help to become a reality. It is not very likely that you will be able to sell your car just by putting your RDF file on the Internet. One day you will be able to collect information about almost everything on the web in a standardized RDF format.It might not be free.Searching information about things on the Internet will be much easier than before.
CSPIT(IT)
10
10IT102
Semantic Web
If computers become more adept at pulling together data from different sources?In the movie Terminator, humanity slid down the path to destruction when a supercomputer called Skynet started to become smarter on its own. I was reminded of that possibility during my report on the semantic Web. The most general definition is that the semantic Web speaks to how we are moving from a Web of documents to a Web of linked data. You might think that Google algorithms do a fine job of finding information and organizing the Web. These semantic Web visionaries would like humans to work with machines to make the data we create more easily accessible and analyzed. This is the fundamental difference between linked documents (what Google does) and linked data. Tim Berners-Lee, who invented the World Wide Web, says the semantic Web will give information a well-defined meaning, better enabling computers and people to work in cooperation. If machines can look at all documents and pull out the who, what, when and where (and someday how and why) so other machines can understand them in a standardized way, then all sorts of interesting opportunities arise for how that information can be found and used. Names and places and ideas and even emotions expressed in stories become much more than just words in one story; they become the way that all of the information in many documents can be linked and layered together to create new documents and stories. The goal is to generate better answers using organized data.
10IT102
Semantic Web
The Web is messy and filled with contradictory and unclear information that still needs human interpretation. Semantic Web advocates know the Internet will remain confusing for machines because we humans keep changing it in unexpected ways, not to mention our habit of inventing new words and meaning.
CSPIT(IT)
12
10IT102
Semantic Web
CONCLUSION
There exist slightly differing application and usage patterns of Social Software between the two groups, but the general trend says: Wikis are king! Social Bookmarking stays behind. There exists broad consent about the benefits of Social Software. Both groups say that quick access to information and knowledge is the biggest benefit generated by Social Software, followed by social networking functionalities and ubiquitous access to documents and data. Search is the killer app! Integration costs & data control might be important aspects. The expected time to market is 2 5 years. The Semantic Web lets you do things fast. And because you can do things fast, you can do lots more things than you could before. The word 'fast" can be a bit deceptive when talking about technology. We can all be a bit obsessed with what I call stopwatch time. Stopwatch time is speed measured in seconds (or less). It's raw performance: How much quicker does my laptop boot up with an SSD? How long does it take to load 100 million records into a database? How many queries per second does your SPARQL implementation do on the Berlin benchmark with and without a recent round of optimizations? If my relational database application renders* a sales forecast report in 500 milliseconds while my Semantic Web application takes 5 seconds, you might hear people say that the relational approach is 10 times faster than the Semantic Web approach. But if it took six months to design and build the relational solution versus two weeks for the Semantic Web solution, Semantic Sam will be adjusting his supply chain and improving his efficiencies long before Relational Randy has even seen his first report.
CSPIT(IT)
13
10IT102
Semantic Web
REFERENCES
CSPIT(IT)
14