Discussion:
Debugging fuzzy searches
Sebastian Hagedorn
2018-05-25 13:59:06 UTC
Permalink
Hi,

now that I know that in current releases of Cyrus 3 is only used for fuzzy
searches, I wanted to experiment with them, but something isn't right:

C SEARCH BODY "Jahren"
* SEARCH 20 77 211 224 255 264 331 351 368 383 404 436 438 439 484 551 568
596 738 754 757 760 761 773 780 810 840 850 892 918 939 946 973 991 1000
1005 1042 1094 1105 1118 1119 1151 1159 1223 1272 1280 1299 1311 1318 1337
1349 1352 1366 1375 1378 1385 1417 1422 1423 1431 1436 1445 1446 1459 1460
1465 1469 1475 1476 1481 1485 1489 1490 1493 1494 1501 1503 1510 1514 1524
1551 1557 1559 1561 1567 1573 1575 1577 1578 1607 1611 1614 1621 1622 1624
1629 1633 1637 1638 1643 1647 1648 1649 1653 1673 1676 1677 1682 1683 1686
1687 1688 1706 1708 1711 1712 1715 1721 1722 1724 1725 1726 1731 1735 1748
1749 1753 1756 1759 1760 1761 1762 1766 1779 1780 1803 1805 1806 1807 1809
1810 1812 1903 1904
C OK Completed (144 msgs in 0.650 secs)
D SEARCH FUZZY BODY "Jahren"
* SEARCH
D OK Completed (0 msgs in 0.000 secs)

The xapian index for the user exists, and I can see in strace that the
imapd process accesses it. What is going on? This is 3.0.6 ...
--
.:.Sebastian Hagedorn - Weyertal 121 (GebÀude 133), Zimmer 2.02.:.
.:.Regionales Rechenzentrum (RRZK).:.
.:.UniversitÀt zu Köln / Cologne University - ✆ +49-221-470-89578.:.
Robert Stepanek
2018-05-25 14:36:02 UTC
Permalink
Does any of the message returned by a regular SEARCH contain the verbatim word "Jahren"? The issue you are describing might be related to stemming.

Cheers,
Robert
Post by Sebastian Hagedorn
Hi,
now that I know that in current releases of Cyrus 3 is only used for fuzzy
C SEARCH BODY "Jahren"
* SEARCH 20 77 211 224 255 264 331 351 368 383 404 436 438 439 484 551 568
596 738 754 757 760 761 773 780 810 840 850 892 918 939 946 973 991 1000
1005 1042 1094 1105 1118 1119 1151 1159 1223 1272 1280 1299 1311 1318 1337
1349 1352 1366 1375 1378 1385 1417 1422 1423 1431 1436 1445 1446 1459 1460
1465 1469 1475 1476 1481 1485 1489 1490 1493 1494 1501 1503 1510 1514 1524
1551 1557 1559 1561 1567 1573 1575 1577 1578 1607 1611 1614 1621 1622 1624
1629 1633 1637 1638 1643 1647 1648 1649 1653 1673 1676 1677 1682 1683 1686
1687 1688 1706 1708 1711 1712 1715 1721 1722 1724 1725 1726 1731 1735 1748
1749 1753 1756 1759 1760 1761 1762 1766 1779 1780 1803 1805 1806 1807 1809
1810 1812 1903 1904
C OK Completed (144 msgs in 0.650 secs)
D SEARCH FUZZY BODY "Jahren"
* SEARCH
D OK Completed (0 msgs in 0.000 secs)
The xapian index for the user exists, and I can see in strace that the
imapd process accesses it. What is going on? This is 3.0.6 ...
--
.:.Sebastian Hagedorn - Weyertal 121 (Gebäude 133), Zimmer 2.02.:.
.:.Regionales Rechenzentrum (RRZK).:.
.:.Universität zu Köln / Cologne University - ✆ +49-221-470-89578.:.
----
Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/
https://lists.andrew.cmu.edu/mailman/listinfo/info-cyrus
+ Attachment1.2
1k (application/pgp-signature)
----
Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/
To Unsubscribe:
https://lists.andrew.cmu
Sebastian Hagedorn
2018-05-25 15:30:08 UTC
Permalink
Post by Robert Stepanek
Does any of the message returned by a regular SEARCH contain the verbatim
word "Jahren"?
Yes
Post by Robert Stepanek
The issue you are describing might be related to stemming.
Doesn't look like it:

D SEARCH FUZZY BODY "Jahr"
* SEARCH
D OK Completed (0 msgs in 0.000 secs)

I have tried various searches, and non seemed to work. Maybe something
isn't set up right, but how can I see what the problem is?

Thanks
Sebastian
Post by Robert Stepanek
Post by Sebastian Hagedorn
Hi,
now that I know that in current releases of Cyrus 3 is only used for
fuzzy searches, I wanted to experiment with them, but something isn't
C SEARCH BODY "Jahren"
* SEARCH 20 77 211 224 255 264 331 351 368 383 404 436 438 439 484 551
568 596 738 754 757 760 761 773 780 810 840 850 892 918 939 946 973 991
1000 1005 1042 1094 1105 1118 1119 1151 1159 1223 1272 1280 1299 1311
1318 1337 1349 1352 1366 1375 1378 1385 1417 1422 1423 1431 1436 1445
1446 1459 1460 1465 1469 1475 1476 1481 1485 1489 1490 1493 1494 1501
1503 1510 1514 1524 1551 1557 1559 1561 1567 1573 1575 1577 1578 1607
1611 1614 1621 1622 1624 1629 1633 1637 1638 1643 1647 1648 1649 1653
1673 1676 1677 1682 1683 1686 1687 1688 1706 1708 1711 1712 1715 1721
1722 1724 1725 1726 1731 1735 1748 1749 1753 1756 1759 1760 1761 1762
1766 1779 1780 1803 1805 1806 1807 1809 1810 1812 1903 1904
C OK Completed (144 msgs in 0.650 secs)
D SEARCH FUZZY BODY "Jahren"
* SEARCH
D OK Completed (0 msgs in 0.000 secs)
The xapian index for the user exists, and I can see in strace that the
imapd process accesses it. What is going on? This is 3.0.6 ...
--
Sebastian Hagedorn - Weyertal 121, Zimmer 2.02
Regionales Rechenzentrum (RRZK)
UniversitÀt zu Köln / Cologne University - Tel. +49-221-470-89578
Loading...