Simple and Advanced Sorting

November 12th, 2007

One of the last big features for version 1 of WeSay has been in a for while.  Someone (I won’t mention any names) did a great job on it but didn’t blog about it.  So I’ll see if I can do it justice.

In this screen shot we see the three ways you can now specify sorting:

2007-11-09_16-45-54-413

Sort like another language

If the text sorts just like some major language, just select that language in the list and you’re done.

Custom Simple

Many languages based on Latin characters introduce a small number of "special characters" used to represent sounds not covered by A-Z, like a barred i. In these situations, you can specify the rules just like you do in many existing apps, like Toolbox and Lexique Pro. When you choose "custom simple", the rules box is filled with rules needed to sort English. You can enter vernacular works in the "Test Sort" area:

2007-11-09_16-56-07-999

We want the barred-i to sort just after i, so we add it to the rules and click the button:

2007-11-09_16-58-41-205

Normally, these secondary distinctions are enough.  But for some languages, tertiary distinctions are needed. We get these in the simple rules by using parentheses. Consider this list of words:

2007-11-09_17-12-49-809

Now, imagine we want the upper-case words to sort together.  We need to add in another level of distinction, so that case can trump the accents.  We do this by adding parentheses around all case pairs, and putting the two sets of e’s on the same line:

2007-11-09_17-17-23-560

Eric has written up the details on our wiki.

Custom ICU rules

For languages that need them, WeSay also supports ICU tailorings, which look like this:

& C < č <<< Č < ć <<< Ć   –for Serbian (Latin) or Croatian

Like many features of WeSay, this simple-to-advanced collating actually lives in our "Palaso Library", which is of course open-source and can be included in other programs.  Thus we foresee a day soon when the setup you do in one program (e.g. WeSay) will be trivially usable in other language-development tools.

Happy sorting!

Configuration Tool Remodel

September 21st, 2007

One of the benefits of being stuck at home recovering from Dengue Fever is I feel like it’s ok to splurge a bit on stuff that I want to do, rather than have to do.

For example, while we have always put a lot of effort into keeping WeSay clean and easy, its companion WeSay Configuration Tool has always been about being just-functional-enough-to-get-by.

So I spent two days remodeling it, trying to get it clean and easier to navigate. Here’s the before:

(click to see full size)

confighomepageb4.png

and after:

2007-09-21-17-06-48-750.png

Yes, lacking any artistic ability myself, I’ve ripped off a bunch of open-source Tango icons, tweaked them with the indispensable InkScape, and otherwise copied the latest Skype’s options dialog. There are still quite a number of rough edges in there, but this was a good step.

Pictures in dictionary entries

September 19th, 2007

I’ve been working on simple support for pictures in WeSay:

When you choose a picture, it is copied to the “pictures” sub-folder of the WeSay folder.  So everything stays together.

Some more things to do someday:

1) reduce the picture size, if we can determine that this is a good thing to do. This would make backing up a lot less expensive.

2) Show the pictures in the article preview (top of the window) and add it to the html output.