|
|
|
Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data (Data-Centric Systems and Applications)
Web mining aims to discover useful information and knowledge from the Web hyperlink structure, page contents, and usage data. Although Web mining uses many conventional data mining techniques, it is not purely an application of traditional data mining due to the semistructured and unstructured nature of the Web data and its heterogeneity. It has also developed many of its own algorithms and techniques. Liu has written a comprehensive text on Web data mining. Key topics of structure mining, content mining, and usage mining are covered both in breadth and in depth. His book brings together all the essential concepts and algorithms from related areas such as data mining, machine learning, and text processing to form an authoritative and coherent text. The book offers a rich blend of theory and practice, addressing seminal research ideas, as well as examining the technology from a practical point of view. It is suitable for students, researchers and practitioners interested in Web mining both as a learning text and a reference book. Lecturers can readily use it for classes on data mining, Web mining, and Web search. Additional teaching materials such as lecture slides, datasets, and implemented algorithms are available online. .
Price: $47.96
[ Notify me when price goes down.]
|
|
Spidering Hacks
Written for developers, researchers, technical assistants, librarians, and power users, Spidering Hacks provides expert tips on spidering and scraping methodologies. You'll begin with a crash course in spidering concepts, tools (Perl, LWP, out-of-the-box utilities), and ethics (how to know when you've gone too far: what's acceptable and unacceptable). Next, you'll collect media files and data from databases. Then you'll learn how to interpret and understand the data, repurpose it for use in other applications, and even build authorized interfaces to integrate the data into your own content. By the time you finish Spidering Hacks, you'll be able to: - Aggregate and associate data from disparate locations, then store and manipulate the data as you like
- Gain a competitive edge in business by knowing when competitors' products are on sale, and comparing sales ranks and product placement on e-commerce sites
- Integrate third-party data into your own applications or web sites
- Make your own site easier to scrape and more usable to others
- Keep up-to-date with your favorite comics strips, news stories, stock tips, and more without visiting the site every day
Like the other books in O'Reilly's popular Hacks series, Spidering Hacks brings you 100 industrial-strength tips and tools from the experts to help you master this technology. If you're interested in data retrieval of any type, this book provides a wealth of data for finding a wealth of data..
Price: $7.34
[ Notify me when price goes down.]
|
|
The Webmaster Webmistress Course : How to be a web architect, web developer, site author or website administrator
Every day, thousands of people think about starting their own Webmaster business. Some want to break away from the daily drudgery of working for someone else. Some crave the flexibility of working from home. Some need to supplement their main income. Some feel they are ready to expand their services beyond their circle of contacts. Still others… the list of personal reasons could go and on. But here’s the catch… Every day, most of these people do nothing but dream. The Webmaster BUSINESS Masters Course was written for Webmasters who are ready to stop dreaming and start building a home-based Web site design business. Perhaps you are presently employed full-time/part-time in the field. Or perhaps you design sites for relatives, friends or associates as a favor in your spare time. It doesn’t matter. You already know, based on current and past experiences, that consumer demand for Webmaster services is substantial and that it’s not about to evaporate anytime soon..
Price: $1.59
[Notify me when price goes down.]
|
|
Topic-specific crawling on the Web with the measurements of the relevancy context graph [An article from: Information Systems]
This digital document is a journal article from Information Systems, published by Elsevier in 2006. The article is delivered in HTML format and is available in your Amazon.com Media Library immediately after purchase. You can view it with any web browser. Description: One of the major problems for automatically constructed portals and information discovery systems is how to assign proper order to unvisited web pages. Topic-specific crawlers and information seeking agents should try not to traverse the off-topic areas and concentrate on links that lead to documents of interest. In this paper, we propose an effective approach based on the relevancy context graph to solve this problem. The graph can estimate the distance and the relevancy degree between the retrieved document and the given topic. By calculating the word distributions of the general and topic-specific feature words, our method will preserve the property of the relevancy context graph and reflect it on the word distributions. With the help of topic-specific and general word distribution, our crawler can measure a page's expected relevancy to a given topic and determine the order in which pages should be visited first. Simulations are also performed, and the results show that our method outperforms than the breath-first and the method using only the context graph. .
Price: $10.95
[ Notify me when price goes down.]
|
|
The Semantic Web Research and Applications: 5th European Semantic Web Conference, ESWC 2008, Tenerife, Canary Islands, Spain (Lecture Notes in Computer Science) (Lecture Notes in Computer Science)
This book constitutes the refereed proceedings of the 5th European Semantic Web Conference, ESWC 2008, held in Tenerife, Canary Islands, Spain, in June 2008. The 51 revised full papers presented together with 3 invited talks and 25 system description papers were carefully reviewed and selected from a total of 270 submitted papers. The papers are organized in topical sections on agents, application ontologies, applications, formal languages, foundational issues, learning, ontologies and natural language, ontology alignment, query processing, search, semantic Web services, storage and retrieval of semantic Web data, as well as user interfaces and personalization. .
Price: $127.80
[ Notify me when price goes down.]
|
|
Web Dynamics: Adapting to Change in Content, Size, Topology and Use
The World-Wide-Web is a ubiquitous, global tool, used for finding information, communicating ideas, carrying out distributed computation, and conducting business. The web is highly dynamic in the quantity and nature of the information that it encompasses, which poses a host of challenges. There is a need to understand how the information content and usage of the web change, and to develop techniques for organising and manipulating web information which can handle and exploit its inherent dynamics. Access to the web may be from a variety of devices and interfaces, different users at different locations, and at varying times. There is thus also a need for techniques which dynamically adapt information presentation to the mode of access and to the specific user requirements..
Price: $37.90
[Notify me when price goes down.]
|
|
|
|
|