We have SharePoint 2013 May CU 17 installed.
The search crawls logs are reporting numerous errors on specific files related to the "The content processing pipeline failed to process the item" error. On further investigation it seems the error is related to the TermExtractor. It seems there is no available documentation on this specific error.
Any help in this regard is greatly appreciated.
Coder81
Crawl Log Error (example)
The content processing pipeline failed to process the item. ( Object reference not set to an instance of an object.; ; SearchID = 6D67FFC0-8961-4FE4-8141-2D7388FD6917 )
ULS Error (example)
[Microsoft.CrawlerFlow-d14fe52f-334b-4c17-ae29-376749518eef] Microsoft.Ceres.Evaluation.Engine.ErrorHandling.HandleExceptionHelper : Evaluation failure detected: Operator : TermExtractorSub Operator type : TermExtractor Error id : 0 Correlation id : 861fb29c-2b5b-46ed-a670-f218d9e135fc Partition id : 0c37852b-34d0-418e-91c6-2ac25af4be5b Message : Object reference not set to an instance of an object. 49691C90-7E17-101A-A91C-08002B2ECDA9:#9: http://somexample.com/examplepath/thumbn id : ssic://394888 System.NullReferenceException: Object reference not set to an instance of an object.
at Microsoft.Ceres.ContentEngine.NlpEvaluators.TermExtractorEvaluator.TermExtractorProducer.ProcessRecord(IRecord r)
at Microsoft.Ceres.Evaluation.Processing.Producers.ProducerHelper.ThreadPoolTask(Object state)
at Microsoft.Ceres.Evaluation.Processing.Producers.ProducerHelper.ProcessUsingThreadPool(IRecord record, OperatorBase op, String logPrefix, IRecordProducer producer, IEvaluationContext context)
at Microsoft.Ceres.Evaluation.Processing.Producers.ProducerRecordSetSink.Put(IRecord record)
at Microsoft.Ceres.Evaluation.Engine.ErrorHandling.HandleExceptionRecordSetSink.DoWithoutTryCatch(IRecord record)
at Microsoft.Ceres.Evaluation.Processing.Producers.ProducerRecordSetSink.PushToOutput()
at Microsoft.Ceres.Evaluation.Processing.Producers.ProducerRecordSetSink.Put(IRecord record)
at Microsoft.Ceres.Evaluation.Engine.ErrorHandling.HandleExceptionRecordSetSink.DoWithoutTryCatch(IRecord record)
at Microsoft.Ceres.Evaluation.Processing.Producers.ProducerRecordSetSink.PushToOutput()
at Microsoft.Ceres.Evaluation.Processing.Producers.ProducerRecordSetSink.Put(IRecord record)
at Microsoft.Ceres.Evaluation.Engine.ErrorHandling.HandleExceptionRecordSetSink.DoWithoutTryCatch(IRecord record)
at Microsoft.Ceres.Evaluation.Processing.Producers.ProducerRecordSetSink.PushToOutput()
at Microsoft.Ceres.Evaluation.Processing.Producers.ProducerRecordSetSink.Put(IRecord record)
at Microsoft.Ceres.Evaluation.Engine.ErrorHandling.HandleExceptionRecordSetSink.DoWithoutTryCatch(IRecord record)
at Microsoft.Ceres.Evaluation.Engine.ErrorHandling.HandleExceptionRecordSetSink.DoWithoutTryCatch(IRecord record)
at Microsoft.Ceres.Evaluation.Processing.Producers.ProducerRecordSetSink.PushToOutput()
at Microsoft.Ceres.Evaluation.Processing.Producers.ProducerRecordSetSink.Put(IRecord record)
at Microsoft.Ceres.Evaluation.Engine.ErrorHandling.HandleExceptionRecordSetSink.DoWithoutTryCatch(IRecord record)
at Microsoft.Ceres.ContentEngine.Processing.Events.EventRecordSetSink.Put(IRecord record)
at Microsoft.Ceres.Evaluation.Engine.ErrorHandling.HandleExceptionRecordSetSink.DoWithoutTryCatch(IRecord record)
at Microsoft.Ceres.Evaluation.Processing.Producers.ProducerRecordSetSink.PushToOutput()
at Microsoft.Ceres.Evaluation.Processing.Producers.ProducerRecordSetSink.Put(IRecord record)
at Microsoft.Ceres.Evaluation.Engine.ErrorHandling.HandleExceptionRecordSetSink.DoWithTryCatch(IRecord record)