Recoll

Utilities

Source (link to git-repo or to original if based on someone elses unmodified work): Add the source-code for this project on opencode.net

7
7 .8
Description:

Recoll is a personal full text search tool for Unix/Linux.

It is based on the very strong Xapian backend.

It provides an easy to use, feature-rich interface with a Qt GUI.

Most common document types are supported are supported along with their compressed versions (Text, HTML, PDF, Dvi, PostScript, Openoffice, Lyx, Scribus, Word/Excel/PPT, Abiword, Kword, Wordperfect, RTF, djvu, gaim logs, maildir and mailbox mail folders including attachments, misc media files).

Powerful query facilities are provided from simple keyword entry to assisted boolean query building with proximity clauses, filtering on file types or location. A Xesam-compatible query language also supports field searches, and date filtering.

Multiple character sets are supported. Internal processing and storage uses Unicode UTF-8.

Recoll has few dependancies. No database daemon, Web server, or exotic language/framework is necessary. In the default setup, it only runs on your system when you need it. Indexing can be performed in batch mode or in real time.

Thanks to Xapian, indexing does not tax system resources excessively and searching is very fast.
Last changelog:

6 years ago

Latest 1.19 is 1.19.13: this hopefully fixes the last remaining bug in the multithreading code, which was causing quite rare, but ennoying crashes. You definitely want to upgrade to this version if you are running recoll 1.19.

Release 1.19 brings faster indexing for multiprocessors, new results management features (multiple attachment saves, duplicates listing), advanced search history storage, and other performance and usability enhancements. Also, a nice new PPT filter, Python 3 compatibility, and, for Ubuntu Users a Scope for the Dash on Saucy and Trusty.

Release 1.18.1 brings optional case- and diacritics-sensitive searches, complex search history, direct access to hit pages for PDF documents.

Release 1.17.3 brings a number of usability improvement: management of indexing operations from the GUI, filtering on file size, extended directory filtering, Ubuntu Unity Lens, thumbnails in result lists, Okular notes and Gnumeric filters, etc.

Release 1.16.2 brings a long list of small improvements and bug fixes. Image previews, negative directory filtering, anchored searches, more popup menu entries, etc. Please check the release notes for details (http://www.recoll.org/release-1.16.html).

Release 1.15 (.9): Enhanced native Qt 4 user interface (no more Qt 3 compatibility). Switchable table-like display for the results. Direct access to sort functions. Negative directory filtering. Web archive formats.

Release 1.14 (.3): Modification date searches and filtering. New GNU info filter. Improved Thunderbird mail indexing. Other small bug fixes. date searches and filtering, arbitrary email header indexing, new audio tag extractor based on the Mutagen Python library, and miscellaneous other improvements.

Release 1.13 (.04): New class of persistent filters and indexed file types: zip, chm, ics. Improved big text files handling, Firefox visited pages indexing. Quite a few other performance and usability improvements.

Release 1.12: new KDE KIO slave module, collapsing of identical results, context-sensitive F1 help, saving email attachments and other embedded documents to files, and other small improvements and bug fixes.

Release 1.11: easy filtering of results by document type, nicer previews which use html when possible, python programming interface for indexing and searching, better support for the Xesam user query language, new filter framework, better support for arbitrary field indexing and searching.

Release 1.10:
- Created mailing-list to improve support. Check home page.
- Fixed openSuse 11 compile issues.
- Fixed bug in interpreting email mime structure, which resulted in base-64 decoding errors.
- Fixed "Prev" button in preview window. Would actually go forward when walking the search terms.
- Allow setting the highlight color for search terms in result list and preview
- Added svg filter
- Ensure that in case the data of a file can't be indexed because of some error, at least the file name is indexed.
- Improve query language to support OR queries of terms with field specifications (ie: title:someterm OR author:someauthor).
- Fix filename search to split patterns on white space, so that a "*.jpg *.jpeg" search does what's expected. Means you now need to use double-quotes if there is actual embedded white space.
- Jump directly to the external editor choice dialog instead of opening preferences when an external viewer is not found.
- Allow stopping indexing through menu action (only works with qt4 for now).
- Create an "indexedmimetypes" configuration variable to allow explicitely restricting the file types which do get indexed.
- Adds support for CJK text, and a GUI configuration tool for the main configuration file.

Release 1.9: This release brings a number of small practical improvements: new filters: Wordperfect, Abiword, Kword, jpeg, flac, ogg; better control of disk and memory usage during indexing; improved abstract generation; arbitrary field support; improved qt4 support; and miscellaneous user interface improvements and bug fixes, described in more detail in the Changes file.

C

medoc

10 years ago

Parsing recollq text output from a C++ program must not be fun, there would be a point in using the internal APIS. If you just look at the code in recollq.cpp (or the Python C++ module), you get a relatively safe/stable api because I'd have to change recollq + the php/python/kioslave and whatever other modules if I touched it.

I'll try the recoll runner !

You can get in touch with me at jfd at recoll. org if you want.

Report

veeall

10 years ago

Can it be made to index contents of Konquerors web archives(*.war)?

Great software, btw! Thanks!

Report

molecule-eye

10 years ago

I just installed from the Kubuntu PPA Recoll 1.13.01 (and Xapian 1.0.17) and the interface appears to be Qt3, which obviously doesn't match with the rest of my KDE4.3 desktop. I notice on Recoll's website that it uses Qt4 (or CAN) and there are even pretty screenshots of a Qt4-looking Recoll. So why is mine so ugly?

Report

C

medoc

10 years ago

Sorry I did not answer this, I don't seem to get email when comments are added. I think that the current PPA versions for recent Ubuntu releases use qt4. If the package for your release is based on qt3 (I'm a bit too conservative sometimes), you'll have to build from source.

Cheers,

jf

Report

Okanda

11 years ago

Great application, thank you very much :)

Report

9

google01103

13 years ago

kde integration thru a kio slave would be a real nice addition

Report

9

google01103

12 years ago

any chance?

Report

C

medoc

11 years ago

There is a KIO slave in the current version. I don't find it extremely useful, but I'd gladly welcome comments (email me at jfd@recoll.org).

You need to build from source, the instructions for building the slave are inside the source tree:
kde/kio/recoll/00README.txt

Report

9

google01103

11 years ago

fyi I've compiled it and find it useful

Report

hasinasi

13 years ago

I've recently changed from Windows to Kubuntu. I tried beagle, I tried strigi, I tried google desktop search. Non of the search tools were even close to what I wanted.
I was so frustrated that I already tried to run old versions of CopernicDesktopSearch (the tool I liked best when I ran Windows) in Wine. But now the pain is over! Recoll is so much better
than any of those other solutions! I haven't used it a lot, but it looks extremely powerful, so far! Give it a try.

Report

lans

13 years ago

I can't open the homepage of recoll , who can email its source package to me ,thank you .

lenin.lee[at]gmail.com

Report

ahorseman

13 years ago

I found this after finally giving up on trying to maintain a stable copy Beagle on Slackware 10.2 and I wish I'd found it sooner.
Recoll does everything I want a desktop search tool to do. The GUI is a little unusual but effective, I especially like the advanced search. The database footprint is small and the search is surprisingly fast. I couldn't get the realtime monitoring to work but that doesn't matter as a daily automatic update is enough.
The user manual is good, unlike some projects it tells me how to use the app not just how to build it.
On the subject of building from source, there are dependencies but they are only one level, the links are listed on the web page, and they work. Not the dependency hell of Beagle
I'm using this to manage all the pdfs of papers I'm collecting during my PhD. It means I can just roughly sort them into directories and use Recoll to find the right one (and some I hadn't thought of as relevant).

Thank you for an excellent app and please keep up the good work.

Report

arandamandi

13 years ago

I use --enable-qt4
now works :)

Report

arandamandi

13 years ago

make[1]: Entering directory `/home/giuseppe/Desktop/recoll-1.8.1/recoll-1.8.1/qtgui'
/opt/qt4/bin/uic advsearch.ui -o .ui/ui_advsearch.h
uic: File generated with too old version of Qt Designer


:(

Qt4-4.2.2
gcc-4
slackware-current

Report

arandamandi

13 years ago

This is a very GOOD
app :)
Fast and simple
Hope will add preview for video files
and integration kio in kde ;)

Report

mcz

13 years ago

I found Recoll on the net for etwa one week and I compile it on my KUbuntu-64bit.
At the time I had to change a filter (rclsof) file with the correct one present on the program's site, nedeed to index the open office files (I don't know if this correct file is now in the package).
I run the programm intensively up today and I remain very impressed for its performance.

I used other dearch tools (Beagle, Strigi, etc,) in the past, but I found this one really superior.

I appreciate the time it needs to index my data (etwa 50.000 files for 13 GB disk space in 30 minutes).
I appreciated the powerful of the query facilities (no more hundreds of file in the result's list).
I appreciate the possibility to run it with or whithout a daemon.

I liked it so much that I translated the english Manual (you can find on the site a link to the italian translation).

Really a good,good work!

Thank you.

Report

6 years ago

Latest 1.19 is 1.19.13: this hopefully fixes the last remaining bug in the multithreading code, which was causing quite rare, but ennoying crashes. You definitely want to upgrade to this version if you are running recoll 1.19.

Release 1.19 brings faster indexing for multiprocessors, new results management features (multiple attachment saves, duplicates listing), advanced search history storage, and other performance and usability enhancements. Also, a nice new PPT filter, Python 3 compatibility, and, for Ubuntu Users a Scope for the Dash on Saucy and Trusty.

Release 1.18.1 brings optional case- and diacritics-sensitive searches, complex search history, direct access to hit pages for PDF documents.

Release 1.17.3 brings a number of usability improvement: management of indexing operations from the GUI, filtering on file size, extended directory filtering, Ubuntu Unity Lens, thumbnails in result lists, Okular notes and Gnumeric filters, etc.

Release 1.16.2 brings a long list of small improvements and bug fixes. Image previews, negative directory filtering, anchored searches, more popup menu entries, etc. Please check the release notes for details (http://www.recoll.org/release-1.16.html).

Release 1.15 (.9): Enhanced native Qt 4 user interface (no more Qt 3 compatibility). Switchable table-like display for the results. Direct access to sort functions. Negative directory filtering. Web archive formats.

Release 1.14 (.3): Modification date searches and filtering. New GNU info filter. Improved Thunderbird mail indexing. Other small bug fixes. date searches and filtering, arbitrary email header indexing, new audio tag extractor based on the Mutagen Python library, and miscellaneous other improvements.

Release 1.13 (.04): New class of persistent filters and indexed file types: zip, chm, ics. Improved big text files handling, Firefox visited pages indexing. Quite a few other performance and usability improvements.

Release 1.12: new KDE KIO slave module, collapsing of identical results, context-sensitive F1 help, saving email attachments and other embedded documents to files, and other small improvements and bug fixes.

Release 1.11: easy filtering of results by document type, nicer previews which use html when possible, python programming interface for indexing and searching, better support for the Xesam user query language, new filter framework, better support for arbitrary field indexing and searching.

Release 1.10:
- Created mailing-list to improve support. Check home page.
- Fixed openSuse 11 compile issues.
- Fixed bug in interpreting email mime structure, which resulted in base-64 decoding errors.
- Fixed "Prev" button in preview window. Would actually go forward when walking the search terms.
- Allow setting the highlight color for search terms in result list and preview
- Added svg filter
- Ensure that in case the data of a file can't be indexed because of some error, at least the file name is indexed.
- Improve query language to support OR queries of terms with field specifications (ie: title:someterm OR author:someauthor).
- Fix filename search to split patterns on white space, so that a "*.jpg *.jpeg" search does what's expected. Means you now need to use double-quotes if there is actual embedded white space.
- Jump directly to the external editor choice dialog instead of opening preferences when an external viewer is not found.
- Allow stopping indexing through menu action (only works with qt4 for now).
- Create an "indexedmimetypes" configuration variable to allow explicitely restricting the file types which do get indexed.
- Adds support for CJK text, and a GUI configuration tool for the main configuration file.

Release 1.9: This release brings a number of small practical improvements: new filters: Wordperfect, Abiword, Kword, jpeg, flac, ogg; better control of disk and memory usage during indexing; improved abstract generation; arbitrary field support; improved qt4 support; and miscellaneous user interface improvements and bug fixes, described in more detail in the Changes file.

12345678910
114
product-maker darthroe May 06 2014 9 excellent
product-maker sealbhach Dec 14 2012 9 excellent
product-maker dmeyer Oct 14 2012 9 excellent
product-maker remix Apr 03 2012 9 excellent
product-maker paulus3005 Mar 24 2012 3 bad
product-maker cjann Dec 29 2011 9 excellent
product-maker seaman123 Sep 23 2011 9 excellent
product-maker kerenskyy May 29 2011 9 excellent
product-maker lazx888 May 04 2011 9 excellent
product-maker google01103 Mar 05 2011 9 excellent
product-maker groo Feb 17 2011 9 excellent
product-maker yuksing Feb 04 2011 9 excellent
product-maker Alesvol Feb 03 2011 9 excellent
product-maker LazyKent Feb 02 2011 9 excellent
product-maker jamjam Nov 24 2010 9 excellent
Be the first to comment
omiliya
Nov 30 2013
remix
Apr 03 2012
stalin2000
Oct 02 2011
kerenskyy
May 29 2011
groo
Feb 17 2011
tittiatcoke
Sep 19 2010
cjann
Mar 02 2010
File (click to download) Version Description PackagetypeArchitectureRelease Channel Downloads Date Filesize DL OCS-Install MD5SUM
*Needs pling-store or ocs-url to install things
Pling
0 Affiliates
Details
license
version
1.19.13
updated May 06 2014
added Jan 29 2007
downloads 24h
0
mediaviews 24h 0
pageviews 24h 3
System Tags app software