SFX Target for ticTOCs' feed URLs
Please note I: This page is under construction and only provides a summary of our current finding - without guaranteeing completeness & correctness. Feel free to add your comments or to modify the text.
Please note II: Meanwhile the journalTOCs project provides an API on the feed URLs collected by ticTOCs. Thus it would be possible to implemented a plugIn instead of loading the object portfolios.
Idea and Example
The ticTOCs project collected more than 12,000 journal feeds from over 400 publishers and offers an intuitive web interface on the top. The project team share the source data with the community, thus the information can be loaded into the SFX knowledge base to offer a link to a specific journal feed via the SFX service menu.
Example: http://tinyurl.sfx.mpg.de/q4r3
Blog Post: http://blog.vlib.mpg.de/some-appetite-for-table-of-contents-feeds/
Prepare the target setup
- Connect to SFX admin
- Go to the "Targets" section to add a new target by providing following information:
- Target: TICTOCS_LCL
- Public Name: ticTOCs
- Services: getTOC or getCitedJournal (this requires a minor modification of the Bulk::BULK target parser to support the method]
- Modify the corresponding target service by adding
- Public Name: RSS feed for table of contents via
- Parser: Bulk::BULK
- [optional] Displayer: WWW::TICTOC (only required to display the RSS button right after the target name)
- Object Lookup: Yes
- [optional] Upload following files to your server (only required to display the RSS button right after the target name)
- TICTOC.pm - to location sfx_version3/[SFX_INSTANCE]/lib/Parsers/TargetDisplayer/WWW
- get_target_name_TICTOC.tmpl - to location: sfx_version3/[SFX_INSTANCE]/templates/sfxmenu/services
- rss.png - to location: sfx_version3/[SFX_INSTANCE]/templates/img/sfxmenu
Notes
- if you decided to choose the getCitedJournal service (as we have), you need to localize your Bulk::BULK target parser by adding:
-> we submitted this change as enhancement to ExLibris
Load the ticTOCs data
- Download the data file from http://www.tictocs.ac.uk/text.php
- Open the file in your calculation or editor tool (like Excel) and make following modifications:
- in the feed URL column, replace all "&" by "%26"
- add a the string "jkey=" to all feed URLs, e.g. by replacing "http" by "jkey=http"
- copy the eISSN value to the ISSN cell if this is empty
- remove all entries without value in the ISSN cell
- move the ISSNs to the first column
- move the journal feed to the second column
- save the file
- Select the "DataLoader" tool in the SFX admin
- Select Target + Service: TICTOCS_LCL
- Specify input file name: [SFX:your file]
- Column 1 (Primary Key): ISSN
- Column 2: PARSE_PARAM
- Load Type: Add Portfolios
Notes
- While modifying the data file you may notice that quite a lot of journals lack an ISSN. You may use the "Lookup Tool" to improve the rate, but this will require some manual checking. We communicated our findings to the ticTOCs team which hopefully will result in more ISSNs in the data file
- For some ISSNs more than one journal feed URL is available, e.g.
1 Nature http://www.nature.com/nature/current_issue/rss 0028-0836 1476-4679
15721 Nature AOP http://www.nature.com/nature/journal/vaop/ncurrent/rss.rdf 0028-0836 1476-4679
SFX will only create one object portfolio (using the information from the first entry). Therefore, we decided to delete duplicate ISSNs from our loader file


6 Comments
comments.show.hideSep 03, 2009
Jiri Pavlik
Wonderful, thanks!
Sep 03, 2009
Milan Janíček
Nice work again!
I have tried to automatize creation of file for DataLoader - result is here: http://sfx.techlib.cz/sfxlcl3/cgi/public/ticTOC.cgi
It downloads online file (so you should always be able to download current data), uses rules by Inga and displays result... If you find any errors, let me know
Sep 07, 2009
Inga Overkamp
Hi Milan,
and much nicer work done by the Czech
, thanks for sharing the location to your CGI! Do you cache the data source or do you download the file on request?
Actually, it shouldn't be very hard to automate the data-loading process from there... Are you considering to contribute the code in El Commons?
Sep 11, 2009
Milan Janíček
Hi Inga,
i will try to write automated script next week... when I do this I will contribute both CGI and this script here.
Sep 17, 2009
Milan Janíček
Both scripts are now in attachments section...
ticTOC.pl shoud be more useful, but some configuration is needed - I think that changing variables in first part of the script should be enough (and quite simple). Notice that you can turn on report mode ($updateDatabase=0)
Some more issues:
1) I am not sure what to do about journals that were removed from ticTOC list - at the moment they are not deactivated - any command line script that deactivates all portfolios in specified target should solve this, but i am not sure if it is possible
2) I have added third column - AVAILABILITY - to file generated by .pl script for DataLoader - DataLoader was then able to add new portfolios and activate deactivated (which may be useful according issue 1)
3) I had problems with adding this script as cron job - I think problem is that dataloader.pl is not able to get value of enviroment variable SFX_HOME... i haven't find solution for that
(trying to load .cshrc or set this variable inside cron job definition did not help)
May 19, 2011
Kasper Løvschall
Hi all
I've added a code contribution for JournalTOCs using the plugin method as mentioned in note II. Feel free to try it out if you like...
http://exlibrisgroup.org/display/SFXCC/JournalTOCs+as+target+for+SFX+with+plugin+support
Thanks,
Kasper