clear query| facets| time Search criteria: .   Results from 1 to 10 from 528 (0.0s).
Loading phrases to help you
refine your search...
[LUCENE-8564] Make it easier to iterate over graphs in tokenstreams - Lucene - [issue]
...We have a number of TokenFilters that read ahead in the token stream (eg synonyms, shingles) and ideally these would understand token graphs as well as linear streams.  FixedShingleFilt...
http://issues.apache.org/jira/browse/LUCENE-8564    Author: Alan Woodward , 2018-11-13, 23:14
[LUCENE-8497] Rethink multi-term analysis handling - Lucene - [issue]
...The current framework for handling term normalisation works via instanceof checks for MultiTermAwareComponent and casts.  MultiTermAwareComponent itself deals in AbstractAnalysisComponents, ...
http://issues.apache.org/jira/browse/LUCENE-8497    Author: Alan Woodward , 2018-11-06, 16:12
Welcome Tim Allison as a Lucene/Solr committer - Lucene - [mail # dev]
...Congratulations and welcome, Tim!> On 2 Nov 2018, at 16:20, Erick Erickson  wrote:> > Hi all,> > Please join me in welcoming Tim Allison as the latest Lucene/Solr committ...
   Author: Alan Woodward , 2018-11-02, 17:55
Welcome Gus Heck as Lucene/Solr committer - Lucene - [mail # dev]
...Congratulations and welcome!> On 1 Nov 2018, at 12:22, David Smiley > wrote:> > Hi all,> > Please join me in welcoming Gus Heck as the latest Lucene/Solr committer! > &g...
   Author: Alan Woodward , 2018-11-01, 13:17
[LUCENE-8509] NGramTokenizer, TrimFilter and WordDelimiterGraphFilter in combination can produce backwards offsets - Lucene - [issue]
...Discovered by an elasticsearch user and described here: https://github.com/elastic/elasticsearch/issues/33710The ngram tokenizer produces tokens "a b" and " bb" (note the space at the beginn...
http://issues.apache.org/jira/browse/LUCENE-8509    Author: Alan Woodward , 2018-10-29, 16:21
Shingles vs phrases for index size - ElasticSearch - [mail # user]
...It depends on the output of your analysis chain.  If you've configured things to produce shingles, then a query like "global warming" will get analyzed to a single term and so will prod...
   Author: Alan Woodward , 2018-10-15, 08:31
[LUCENE-8516] Make WordDelimiterGraphFilter a Tokenizer - Lucene - [issue]
...Being able to split tokens up at arbitrary points in a filter chain, in effect adding a second round of tokenization, can cause any number of problems when trying to keep tokenstreams to con...
http://issues.apache.org/jira/browse/LUCENE-8516    Author: Alan Woodward , 2018-10-04, 12:33
[GitHub] lucene-solr issue #328: SOLR-12034 - Lucene - [mail # dev]
...See LUCENE-8497 for more details.  Mayya would like to replace the marker interface with type-safe methods on CharFilterFactory and TokenFilterFactory> On 1 Oct 2018, at 16:15, Erick...
   Author: Alan Woodward , 2018-10-01, 16:02
[LUCENE-8373] Move ENGLISH_STOP_WORD_SET from StandardAnalyzer to EnglishAnalyzer - Lucene - [issue]
...Follow-up of LUCENE-7444.  English stopwords should be on the EnglishAnalyzer....
http://issues.apache.org/jira/browse/LUCENE-8373    Author: Alan Woodward , 2018-09-24, 08:03
[LUCENE-8395] WordDelimiterGraphFilter can incorrectly add holes to a TokenStream - Lucene - [issue]
...If a token consists entirely of delimiter characters, then WordDelimiterGraphFilter will remove the token and insert a hole into the TokenStream.  However, it does this even if preserve_orig...
http://issues.apache.org/jira/browse/LUCENE-8395    Author: Alan Woodward , 2018-09-24, 08:03