How we did it:
For any feedback, any questions, any notes or just for chat - feel free to follow us on social networks
Ian H. Witten, Alistair Moffat, Timothy C. Bell
In this fully updated second edition of the highly acclaimed Managing Gigabytes, authors Witten, Moffat, and Bell continue to provide unparalleled coverage of state-of-the-art techniques for compressing and indexing data. Whatever your field, if you work with large quantities of information, this book is essential reading--an authoritative theoretical resource and a practical guide to meeting the toughest storage and access challenges. It covers the latest developments in compression and indexing and their application on the Web and in digital libraries. It also details dozens of powerful techniques supported by mg, the authors' own system for compressing, storing, and retrieving text, images, and textual images. mg's source code is freely available on the Web. * Up-to-date coverage of new text compression algorithms such as block sorting, approximate arithmetic coding, and fat Huffman coding * New sections on content-based index compression and distributed querying, with 2 new data structures for fast indexing * New coverage of image coding, including descriptions of de facto standards in use on the Web (GIF and PNG), information on CALIC, the new proposed JPEG Lossless standard, and JBIG2 * New information on the Internet and WWW, digital libraries, web search engines, and agent-based retrieval * Accompanied by a public domain system called MG which is a fully worked-out operational example of the advanced techniques developed and explained in the book * New appendix on an existing digital library system that uses the MG software
Michael McCandless, Erik Hatcher, Otis Gospodnetić
Lucene remains an indispensable part of most enterprise applications. This search engine now powers Web options in diverse companies, including Netflix, LinkedIn, and the Mayo Clinic. This updated edition is the definitive guide to developing with Lucene.
Serge Linckels, Christoph Meinel
This book introduces a new approach to designing E-Librarian Services. With the help of this system, users will be able to retrieve multimedia resources from digital libraries more efficiently than they would by browsing through an index or by using a simple keyword search. E-Librarian Services combine recent advances in multimedia information retrieval with aspects of human-machine interfaces, such as the ability to ask questions in natural language; they simulate a human librarian by finding and delivering the most relevant documents that offer users potential answers to their queries. The premise is that more pertinent results can be retrieved if the search engine understands the meaning of the query; the returned results are therefore logical consequences of an inference rather than of keyword matches. Moreover, E-Librarian Services always provide users with a solution, even in situations where they are unable to offer a comprehensive answer.
Eric Enge, Stephan Spencer, Rand Fishkin, Jessie Stricchiola, John Battelle
"Four of the most noted experts in the field of search engine optimization (SEO) provide you with proven guidelines and cutting-edge techniques for planning and executing a comprehensive SEO strategy. In this book, you will explore the underlying theory behind SEO and how search engines work, learn the steps you need to prepare for, execute, and evaluate SEO initiatives, examine a number of advanced strategies and tactics, understand the intricacies involved in managing complex SEO projects, learn what'snecessary to build a competent SEO team with defined roles and glimpse the future of search and what lies ahead for the SEO industry."--Publisher's description.