Coverart for item
The Resource Apache Solr Beginner's Guide

Apache Solr Beginner's Guide

Label
Apache Solr Beginner's Guide
Title
Apache Solr Beginner's Guide
Creator
Subject
Language
eng
Summary
Written in a friendly, example-driven format, the book includes plenty of step-by-step instructions and examples that are designed to help you get started with Apache Solr. This book is an entry level text into the wonderful world of Apache Solr. The book will center around a couple of simple projects such as setting up Solr and all the stuff that comes with customizing the Solr schema and configuration. This book is for developers looking to start using Apache Solr who are stuck or intimidated by the difficulty of setting it up and using it. For anyone wanting to embed a search engine in their
http://library.link/vocab/creatorName
Serafini, Alfredo
Dewey number
  • 025.5
  • 025.524
Index
no index present
Language note
English
Literary form
non fiction
Nature of contents
dictionaries
http://library.link/vocab/subjectName
  • Open source software
  • Search engines
  • Web search engines
Label
Apache Solr Beginner's Guide
Instantiates
Publication
Contents
  • Cover; Copyright; Credits; About the Author; Acknowledgments; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Getting Ready with the Essentials; Understanding Solr; Learning the powerful aspects of Solr; Working with Java installation; Downloading and installing Java; Configuring CLASSPATH and PATH variables for Java; Installing and testing Solr; Time for action -- starting Solr for the first time; Taking a glance at the Solr interface; Time for action -- posting some example data; Time for action -- testing Solr with cURL; Who uses Solr?; Resources on Solr
  • How will we use Solr?Summary; Chapter 2 Indexing with Local PDF Files; Understanding and using an index; Posting example documents to the first Solr core; Analyzing the elements we need in Solr core; Time for action -- configuring Solr Home and Solr core discovery; Knowing the legacy solr.xml format; Time for action -- writing a simple solrconfig.xml file; Time for action -- writing a simple schema.xml file; Time for action -- starting the new core; Time for action -- defining an example document; Time for action -- indexing an example document with cURL; Executing the first search on the new core
  • Adding documents to the index from the web UITime for action -- updating an existing document; Time for action -- cleaning an index; Creating an index prototype from PDF files; Time for action -- defining the schema.xml file with only dynamic fields and tokenization; Time for action -- writing a simple solrconfig.xml file with an update handler; Testing the PDF file core with dummy data and an example query; Defining a new tokenized field for fulltext; Time for action -- using Tika and cURL to extract text from PDFs; Using cURL to index some PDF data
  • Time for action -- finding copies of the same files with deduplicationTime for action -- looking inside an index with SimpleTextCodec; Understanding the structure of an inverted index; Understanding how optimization affects the segments of an index; Writing the full configuration for our PDF index example; Writing the solrconfig.xml file; Writing the schema.xml file; Summarizing some easy recipes for the maintenance of an index; Summary; Chapter 3: Indexing Example Data from DBPedia -- Paintings; Harvesting paintings' data from DBPedia; Analyzing the entities that we want to index
  • Analyzing the first entity -- PaintingWriting Solr core configurations for the first tests; Time for action -- defining the basic solrconfig.xml file; Looking at the differences between commits and soft commits; Time for action -- defining the simple schema.xml file; Introducing analyzers, tokenizers, and filters; Thinking fields for atomic updates; Indexing a test entity with JSON; Understanding the update chain; Using the atomic update; Understanding how optimistic concurrency works; Time for action -- listing all the fields with the CSV output; Defining a new Solr core for our Painting entity
  • Time for action -- refactoring the schema.xml file for the paintings core by introducing tokenization and stop words
Control code
ocn867316995
Dimensions
unknown
Extent
1 online resource (324 pages)
Form of item
online
Isbn
9781782162537
Specific material designation
remote
System control number
(OCoLC)867316995
Label
Apache Solr Beginner's Guide
Publication
Contents
  • Cover; Copyright; Credits; About the Author; Acknowledgments; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Getting Ready with the Essentials; Understanding Solr; Learning the powerful aspects of Solr; Working with Java installation; Downloading and installing Java; Configuring CLASSPATH and PATH variables for Java; Installing and testing Solr; Time for action -- starting Solr for the first time; Taking a glance at the Solr interface; Time for action -- posting some example data; Time for action -- testing Solr with cURL; Who uses Solr?; Resources on Solr
  • How will we use Solr?Summary; Chapter 2 Indexing with Local PDF Files; Understanding and using an index; Posting example documents to the first Solr core; Analyzing the elements we need in Solr core; Time for action -- configuring Solr Home and Solr core discovery; Knowing the legacy solr.xml format; Time for action -- writing a simple solrconfig.xml file; Time for action -- writing a simple schema.xml file; Time for action -- starting the new core; Time for action -- defining an example document; Time for action -- indexing an example document with cURL; Executing the first search on the new core
  • Adding documents to the index from the web UITime for action -- updating an existing document; Time for action -- cleaning an index; Creating an index prototype from PDF files; Time for action -- defining the schema.xml file with only dynamic fields and tokenization; Time for action -- writing a simple solrconfig.xml file with an update handler; Testing the PDF file core with dummy data and an example query; Defining a new tokenized field for fulltext; Time for action -- using Tika and cURL to extract text from PDFs; Using cURL to index some PDF data
  • Time for action -- finding copies of the same files with deduplicationTime for action -- looking inside an index with SimpleTextCodec; Understanding the structure of an inverted index; Understanding how optimization affects the segments of an index; Writing the full configuration for our PDF index example; Writing the solrconfig.xml file; Writing the schema.xml file; Summarizing some easy recipes for the maintenance of an index; Summary; Chapter 3: Indexing Example Data from DBPedia -- Paintings; Harvesting paintings' data from DBPedia; Analyzing the entities that we want to index
  • Analyzing the first entity -- PaintingWriting Solr core configurations for the first tests; Time for action -- defining the basic solrconfig.xml file; Looking at the differences between commits and soft commits; Time for action -- defining the simple schema.xml file; Introducing analyzers, tokenizers, and filters; Thinking fields for atomic updates; Indexing a test entity with JSON; Understanding the update chain; Using the atomic update; Understanding how optimistic concurrency works; Time for action -- listing all the fields with the CSV output; Defining a new Solr core for our Painting entity
  • Time for action -- refactoring the schema.xml file for the paintings core by introducing tokenization and stop words
Control code
ocn867316995
Dimensions
unknown
Extent
1 online resource (324 pages)
Form of item
online
Isbn
9781782162537
Specific material designation
remote
System control number
(OCoLC)867316995

Library Locations

    • InternetBorrow it
      Albany, Auckland, 0632, NZ
Processing Feedback ...