Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
reloaded:be_solr [2017/03/27 12:43]
giancarlo
reloaded:be_solr [2017/03/27 12:56]
giancarlo
Line 91: Line 91:
 <copyField source="dc.publisher" dest="dc.publisher_dct"/> <copyField source="dc.publisher" dest="dc.publisher_dct"/>
 ... ...
 +</code>
 +</WRAP>
 +**Stopwords and delimiter**
 +\\
 +In most cases, book language is Italian.
 +<WRAP prewrap center>
 +<code>
 +nano -w /opt/solr/solr/islandora/conf/schema.xml
 +</code>
 +</WRAP>
 +<WRAP prewrap center>
 +<code xml>
 +...
 +    <fieldType name="text_fgs" class="solr.TextField" positionIncrementGap="100">
 +      <analyzer>
 +        <tokenizer class="solr.StandardTokenizerFactory"/>
 +        <filter class="solr.LowerCaseFilterFactory"/>
 +        <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwordsDC.txt"/>
 +      </analyzer>
 +    </fieldType>
 +...
 +    <fieldType name="text" class="solr.TextField" positionIncrementGap="100">
 +      <analyzer type="index">
 +        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
 +        <filter class="solr.HyphenatedWordsFilterFactory"/>
 +        <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="0" catenateNumbers="0" catenateAll="0"
 +                types="wdfftypes.txt"/>
 +        <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt"/>
 +        <filter class="solr.LowerCaseFilterFactory"/>
 +        <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
 +      </analyzer>
 +      <analyzer type="query">
 +        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
 +        <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/>
 +        <filter class="solr.WordDelimiterFilterFactory" generateWordParts="1" generateNumberParts="1" catenateWords="0" catenateNumbers="0" catenateAll="0"
 +                types="wdfftypes.txt"/>
 +        <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt"/>
 +        <filter class="solr.LowerCaseFilterFactory"/>
 +        <filter class="solr.RemoveDuplicatesTokenFilterFactory"/>
 +      </analyzer>
 +    </fieldType>
 </code> </code>
 </WRAP> </WRAP>
 
 
reloaded/be_solr.txt ยท Last modified: 2018/04/09 21:39 by giancarlo

Developers: CNR IRCrES IT Office and Library
Giancarlo Birello (giancarlo.birello _@_ ircres.cnr.it) and Anna Perin (anna.perin _@_ ircres.cnr.it)
DigiBess is licensed under: Creative Commons License
Recent changes RSS feed Creative Commons License Valid XHTML 1.0 Valid CSS Driven by DokuWiki
Drupal Garland Theme for Dokuwiki