Access Keys:
Skip to content (Access Key - 0)
My Area (Access Key - 2)


Toggle Sidebar
Your Rating: Results: PatheticBadOKGoodOutstanding! 2 rates
Labels:
00code00 00code00 Delete
Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.

Community repository of Primo misspelling files

Tags:
Last Updated: Feb 16, 2012 01:36


  • Description

    This extendes the default Primo misspelling file, adds all the misspellings listed in this Wikipead page:http://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines and provides a way to add your own custom misspellings


  • Author: Justin Kelly
  • Additional author(s):
  • Institution: Swinburne University of Technology
  • Year: 2012
  • License: BSD style
  • Short description: Use, modification and distribution of the code are permitted provided the copyright notice, list of conditions and disclaimer appear in all related material.
  • Link to terms: [Detailed license terms]
  • Skill required for using this code:
     basic

What

This extendes the default Primo misspelling file, adds all the misspellings listed in this Wikipead page:http://en.wikipedia.org/wiki/Wikipedia:Lists_of_common_misspellings/For_machines and provides a way to add your own custom misspellings

Why

Primo comes with only 130 misspellings and if you review your primo search queries logs you'll see a lot of common misspellings that aren't covered by Primo. By merging in the Wikipedia source this brings it to over 4,000 and by combining your own misspellings file you can customise this as much as you want

State

Stable

Programming language

Perl, text files

Screen captures

Homepage

https://github.com/justinkelly/primo-misspell

Download

All code and misspellings files can be downloaded from : https://github.com/justinkelly/primo-misspell

Working example

Search for yotube at librarysearch.swinburne.edu.au it will return youtube

Adding new misspellings

You can add new misspellings into the custom_misspell.txt file in the Primo format - such as 

Where the incorrect spelling is the first word followed by a space then the corret spelling. Note the or in the alot statement, this allows you to define multiple spelling results

I've added a sample into the custom_misspell.txt for a common misspelling of our Instituion name

How to run

You will then be asked for which version of Primo you are using, Choose one that corresponds to an available sub-folder ie. 3.1.2

This will now convert the contents of the wikipedia_misspell.txt into Primo misspelling file formatting and merge it with the default Primo eng_misspell.txt file (located in the version sub-folder) and then sort this

Copy the newly prioducted $version/misspell_eng.txt to the Primo misspelling directory at ng/jaguar/home/profile/analysis/misspelling/

Extending to other versions of Primo

If you want to create a misspellings file for a version of Primo not listed just create a subfolder with the version number ie 3.1.0, copy the current primo misspelling file (located at: ng/jaguar/home/profile/analysis/misspelling/misspell_eng.txt) to the version folder

Add/Edit the wikipedia_misspell.txt file if you wish

Then run the ./merge_primo_misspell.pl and upload the new file

Page Attachments

No files have been attached to this page.

Added by Justin Kelly on Feb 16, 2012 01:03, last edited by Justin Kelly on Feb 16, 2012 01:36

Adaptavist Theme Builder Powered by Atlassian Confluence