Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
reloaded:be_solr [2018/04/09 20:04]
giancarlo old revision restored (2017/03/27 12:55)
reloaded:be_solr [2018/04/09 21:39] (current)
giancarlo old revision restored (2017/03/27 12:56)
Line 94: Line 94:
 </WRAP> </WRAP>
 **Stopwords and delimiter** **Stopwords and delimiter**
 +\\
 In most cases, book language is Italian. In most cases, book language is Italian.
 +<WRAP prewrap center>
 +<code>
 +nano -w /opt/solr/solr/islandora/conf/schema.xml
 +</code>
 +</WRAP>
 +<WRAP prewrap center>
 +<code xml>
 +...
 +    <fieldType name="text_fgs" class="solr.TextField" positionIncrementGap="100">
 +      <analyzer>
 +        <tokenizer class="solr.StandardTokenizerFactory"/>
 +        <filter class="solr.LowerCaseFilterFactory"/>
 +        <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwordsDC.txt"/>
 +      </analyzer>
 +    </fieldType>
 +...
 +    <fieldType name="text" class="solr.TextField" positionIncrementGap="100">
 +      <analyzer type="index">
 +        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
 +        <filter class="solr.HyphenatedWordsFilterFactory"/>
 +        <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="0" catenateNumbers="0" catenateAll="0"
 +                types="wdfftypes.txt"/>
 +        <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt"/>
 +        <filter class="solr.LowerCaseFilterFactory"/>
 +        <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
 +      </analyzer>
 +      <analyzer type="query">
 +        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
 +        <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/>
 +        <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="0" catenateNumbers="0" catenateAll="0"
 +                types="wdfftypes.txt"/>
 +        <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt"/>
 +        <filter class="solr.LowerCaseFilterFactory"/>
 +        <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
 +      </analyzer>
 +    </fieldType>
 +</code>
 +</WRAP>
 <WRAP prewrap center> <WRAP prewrap center>
 <code> <code>
 
 
reloaded/be_solr.txt ยท Last modified: 2018/04/09 21:39 by giancarlo

Developers: CNR IRCrES IT Office and Library
Giancarlo Birello (giancarlo.birello _@_ ircres.cnr.it) and Anna Perin (anna.perin _@_ ircres.cnr.it)
DigiBess is licensed under: Creative Commons License
Recent changes RSS feed Creative Commons License Valid XHTML 1.0 Valid CSS Driven by DokuWiki
Drupal Garland Theme for Dokuwiki