Time  Nick     Message
11:16 owen     Yes, it works great. Thank you!
11:16 hdl      ?
11:16 hdl      owen : have you tried my fix.
11:02 kados    paul: yes please :-)
11:01 paul     do youwant a complete french DB ?
11:01 ToinS    Bye all ...
11:01 kados    ok
11:01 paul     don't forget to mail me if you find something interesting even if you don't catch the solution
11:01 kados    ok, I"ll do my best
11:01 paul     otherwise, i'll investigate too.
11:01 paul     OK, 6PM, I hope i'll have a fix for this problem in koha-cvs tomorrow morning ;-)
11:00 kados    yea :-)
11:00 paul     yep. it's the kind of goodies we could add to Koha (like pictures of the core team)
11:00 kados    It think there are some others as well
11:00 kados    it's quite funny :-)
10:59 ToinS    yes !
10:59 paul     yes
10:59 kados    paul: did ToinS discover that? :-)
10:59 kados    paul: :-)
10:59 paul     really LOL !
10:59 paul     kados : did you tried to search /pancakes on team42 interface ?
10:56 kados    paul: I'll install unimarc koha and troubleshoot this
10:55 kados    in fact, I find addbiblio quite confusing
10:55 paul     not well-formed (invalid token) at line 15, column 34, byte 635...
10:54 paul     internal server error when saving !
10:54 kados    ahh
10:53 paul     (although i'm not sure there is no side effect !)
10:52 paul     just comment line 532, and everything is better !
10:52 paul     and line 527/532 => read it again, but FROM XML !
10:52 paul     line 423, if we are in a mod = read biblio from MARCgetbibio
10:51 paul     OK, you were wrong ;-)
10:51 paul     something strange is probably done on line 527+/-
10:48 kados    the page must be build differently for edits than for new records
10:47 kados    agreed, quite strange
10:47 kados    I'm checking
10:47 paul     that's really strange !
10:47 paul     why has the previous page wrong labels ?
10:47 paul     1st question then :
10:47 kados    esp if we agree to only deal with two encodings
10:46 kados    the encoding probs are easier to fix IMO
10:46 paul     html2marc or html2xml ?
10:46 kados    and not have a marc editor incapable of producing valid marc
10:46 paul     which one(s) ?
10:44 paul     :-(((
10:43 paul     [Wed Apr 05 17:43:17 2006] [error] [client 213.41.184.186] not well-formed (invalid token) at line 11, column 19, byte 480 at /usr/lib/perl5/vendor_perl/5.8.6/i386-linux/XML/Parser.pm line 187, referer: http://i8.bureau.paulpoulain.com/cgi-bin/koha/acqui.simple/addbiblio.pl
10:43 paul     and get :
10:43 hdl      owen recommited.
10:42 osmoze   bye all
10:42 kados    each record does not have it's own copy of a label
10:42 kados    but labels are stored in the same place, right?
10:42 kados    wow, that is strange
10:41 paul     (& it works perfectly when you look at a biblio or create a new one)
10:41 paul     yep.
10:41 kados    are you sure you did not convert mysql to have utf-8 in the tables on this test box?
10:40 paul     you'll get correct labels & wrong (very strange !) datas
10:40 paul     hint : on the previous link, force "encoding utf8" in firefox
10:40 kados    in previous version of Koha?
10:39 kados    was char_encode used to dynamically transform descriptions in the editor?
10:39 kados    hmmm ...
10:39 kados    not me
10:39 paul     who transformed descriptions ???
10:38 paul     If you add a new biblio, you'll get empty data, of cours, but VALID labels !
10:38 paul     diacritics are gone for TAG labels as well as datas !
10:38 kados    ahh ... this is not in the _data_ but in the descriptions!
10:38 paul     so :
10:38 paul     L'autorité
10:37 paul     200$a should be
10:37 paul     should be Numéro
10:37 paul     020 tag :
10:37 kados    ok, I'm there
10:37 paul     (test/test should work)
10:37 paul     http://i8.bureau.paulpoulain.com/cgi-bin/koha/acqui.simple/addbiblio.pl?oldbiblionumber=50109&frameworkcode=Default&op=
10:37 paul     so, go to
10:36 kados    finished
10:36 kados    hehe
10:36 paul     (let me know when the history if finished)
10:36 kados    it converts the html form submission into MARCxml
10:36 kados    so i wrote a new method that was easier to read
10:35 kados    and was very difficult to read
10:35 kados    it lost data for instance
10:35 kados    MARChtml2marc was very buggy
10:35 kados    ok ... here's the history
10:35 paul     ??? I don't understand, sorry
10:34 kados    only MARChtml2xml uses XML
10:34 kados    MARCgetbiblio is xml independant
10:34 kados    if you edit or add one with addbiblio, it is utf-8
10:34 paul     yep
10:34 kados    is that because you are editing one you already edited?
10:33 paul     Biblio.pm just call MARCgetbiblio, that is xml independant, right ?
10:33 paul     but... when I reach addbiblio, I immediatly get a problem. html2xml has not be called !
10:33 kados    so the user will never know it was in utf-8
10:33 kados    then, after MARChtml2xml, convert back to 8859-1
10:33 kados    convert to utf-8 in MARChtml2xml at the beginning
10:33 paul     (ok, let's say we have only 2 encodings, that's OK for me !)
10:32 kados    if 8859-1
10:32 kados    check TemplateEncoding var
10:32 kados    our solution could be quite simple
10:31 kados    so if we assume we only must deal with two encodings in 2.4
10:30 kados    because it probably does not exist in UNICODE :-)
10:30 kados     I suspect that MARC-8 was never supported in Koha
10:30 kados    but in fact, Koha already was broken for them with regards to encodings
10:29 kados    no, I must fix this problem
10:29 kados    but I suspect it's a problem Koha has had for some time
10:29 paul     so the new MARC editor don't work for them ?
10:29 kados    no, they still use 8859
10:29 paul     are they in UTF8 ?
10:28 paul     as they have some biblios with diacritics iirc.
10:28 kados    for encoding?
10:28 paul     what do you did for NPL ?
10:27 kados    as it would be hard to confirm it was correct with 50K records
10:27 kados    I am scared to guess the encoding :-)
10:26 paul     scares ?
10:26 kados    I read that as well, it scares me :-)
10:22 kados    (if the record is outside of ascii range?)
10:21 kados    (in 2.2.5)
10:21 kados    so what if a record is not 8859, does it currently display incorrectly when editing?
10:21 paul     yep
10:21 kados    since your templates are 8859?
10:21 kados    so everything is edited as 8859 with your clients?
10:20 kados    right
10:20 paul     for Instance, I just want to solve : editing a record coming from Koha !
10:20 kados    4. record coming from KOha (previously edited/added in Koha)
10:20 kados    3. record coming from Koha (imported with bulkimport)
10:19 kados    2. record coming from resevoir
10:19 kados    1. record coming from Z3950
10:19 kados    paul: we have several cases:
10:19 kados    paul: and for MARC::Record and MARC::File::XML and MARC::Charset :-)
10:19 kados    paul: and with xml parsers for not supporting other encodings
10:18 kados    paul: me too!
10:18 paul     I will really become mad with utf8 & Perl !
10:18 slef     by the way folks, msg me if I seem to be away and I'll see it later
10:17 hdl      hey I'm not dead yet !
10:17 slef     kados: I guess we start with utf-8 in the xml bit and then try to expand it outwards to cover all koha internals.
10:17 paul     kados : 1/2/3 : do you mean we will have to do 1 AND 2 AND 3 everytime ?
10:17 kados    hehe
10:17 paul     hdl : you just loose your commemorative plaque !
10:16 owen     Global symbol "$branchname" requires explicit package name at /home/oleonard/koha/intranet/modules//C4/Auth.pm line 429.
10:16 owen     Global symbol "$branchname" requires explicit package name at /home/oleonard/koha/intranet/modules//C4/Auth.pm line 418.
10:16 owen     Global symbol "$branchname" requires explicit package name at /home/oleonard/koha/intranet/modules//C4/Auth.pm line 414.
10:16 owen     Global symbol "$branchname" requires explicit package name at /home/oleonard/koha/intranet/modules//C4/Auth.pm line 409.
10:16 owen     also:
10:16 paul     so you had better do a left join to get branchname, as it will be called on every page !
10:15 paul     hdl : about your previous commit : left join is faster than where.
10:14 paul     ... thinking...
10:14 kados    paul: do you have any ideas?
10:13 kados    I dont' think any xml parsers recognize marc-8 or some of the other strange encodings in library records
10:12 kados    because MARChtml2xml (which replaces the very buggy MARChtml2marc) requires utf-8
10:12 kados    3. convert back to old encoding after MARChtml2xml
10:12 kados    2. preserve encoding before running MARChtml2xml
10:11 kados    1. check encoding of all records going into editor
10:11 kados    paul: so we could :
10:09 kados    slef: could use some of your input on this as well
10:09 kados    I can reintroduce char_encode, or I can try to build a better method
10:08 kados    so I'm quite puzzled about how to proceed
10:08 kados    right
10:08 paul     (just below "sub create_input")
10:08 kados    ok
10:08 paul     there is one, but it's commented
10:08 kados    paul: and see if modif works
10:08 kados    paul: if there, try removing it
10:07 kados    paul: check addbiblio.pl for an 'encode' line
10:07 kados    right
10:07 hdl      commited in 2.2 branch
10:07 paul     (I did not saw it on add, just on modif !)
10:06 paul     kados : you're right, we have a major problem in MARC editor on default !!!
09:59 kados    heh
09:57 shedges  in French?
09:55 owen     No, but we'll put up a commemorative plaque for hdl ;)
09:54 shedges  ...and I won't be there to get any credit for the improvement :-(
09:53 owen     Thanks, that's going to be a REALLY BIG help for our librarians
09:52 hdl      I do it and commit.
09:52 hdl      I read.
09:51 hdl      owen yes.
09:51 kados    if not, should be fairly painless to add it to userenv
09:50 kados    isn't it just 'branchname'?
09:50 owen     Okay, for whomever: Is there a simple way to pull the branch name at the same time, rather than just the branch code?
09:49 paul     owen : ask hdl, he wrote userenv.
09:49 owen     Paul, I have a question for you about C4::Context->userenv->{branch}
09:49 kados    slef: if you want to bail us out of our encoding probs, please do
09:48 slef     kados: you sure? Even some of our organic flour is drugged already.
09:48 kados    slef: no drugs :-)
09:48 kados    slef: I eat organic flour
09:48 owen     slef: that's why we're all so happy over here
09:48 slef     kados: I thought all flour in the US was drugged already?
09:48 kados    slef: we're drowning in encoding problems
09:47 kados    slef++
09:47 kados    perhaps the solution is to re-encode in the original encoding after using xml
09:47 kados    the problem is that xml must be in utf-8
09:46 kados    but didn't hear anything :-)
09:46 kados    I posted a warning in cvs notes when committing this code
09:46 kados    so yes, even for default I'm afraid
09:45 kados    it's in the MARChtml2xml routine in Biblio.pm
09:45 paul     you mean even for default templates ?
09:45 kados    and then save the record as utf-8
09:45 kados    it may attempt to convert any other char encoding to utf-8 when editing a record
09:44 kados    the character encoding of the new MARC editor is utf-8 by default
09:44 kados    as it relates to the new MARC editor
09:44 kados    which you will want to know about
09:43 kados    this would solve problem #2
09:43 kados    on import
09:43 kados    one idea I had was to encode the entire collection as utf-8
09:43 paul     I think it's the best way to do it.
09:43 paul     you could try to play with Encode module !
09:42 paul     this char_decode causes a lot of problems.
09:42 kados    I'm not even sure how to begin troubleshooting
09:42 paul     yep.
09:42 kados    that are incorrect
09:42 kados    in fact, there are many characters
09:42 kados    is char_decode for MARC-8 data?
09:42 paul     it should be über or Über
09:41 paul     mmm... I bet it's char_decode that is wrong on this specific german char.
09:41 kados    (as wipo is set to 8859)
09:41 kados    or else 8859 is not the proper encoding to display them
09:41 kados    but I believe they are ruined on import
09:41 kados    in 245 $a (title), there are several special characters
09:40 kados    ahh, good idea
09:40 paul     a suggestion (for wipoopac) : the footer would be better with 2 links I think : 1 for koha.org & 1 for liblime.com
09:40 kados    in 245 $a (title)
09:40 kados    http://wipoopac.liblime.com/cgi-bin/koha/opac-MARCdetail.pl?bib=17996
09:40 kados    observe this record:
09:40 kados    but before we discuss that
09:39 kados    for instance, with the marc editor
09:39 kados    we still have some problems in 2.4 related to encoding
09:39 kados    well, wait
09:39 kados    I've got
09:39 kados    another problem
09:39 paul     as it's what existing libraries have.
09:39 kados    absolutely
09:39 paul     but defaulting it to iso8859-1 !
09:39 kados    ok, thanks
09:38 paul     but ok for this patch.
09:38 paul     I've seen that you have it in npl, but it's not in default or css
09:38 paul     (I mean, at least in default templates)
09:38 paul     utf8 is NOT the default for me in 2.4
09:38 paul     back
09:33 kados    hehe
09:33 paul     (on phone again !)
09:33 kados    paul: ?
09:33 kados    for default
09:33 kados    it only takes about 5 mintes to add the template code
09:33 kados    if you want I will commit to 2.4 immediately
09:33 kados    and it works perfectly
09:32 kados    in fact, I've already done this
09:32 kados    <!-- /TMPL_IF -->
09:32 kados    <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
09:32 kados    <!-- TMPL_ELSE -->
09:32 kados    <meta http-equiv="Content-Type" content="text/html; charset=<!-- TMPL_VAR NAME='TemplateEncoding' -->" />
09:32 kados    <!-- TMPL_IF NAME="TemplateEncoding" -->
09:32 kados    then, in templates, put:
09:32 kados    so why not just have a template syspref called 'TemplateEncoding'
09:31 kados    utf-8 => 8859-1, etc.
09:31 kados    they must edit all template headers to change the value
09:31 kados    but it's a real problem if a library has some other encoding
09:31 kados    in templates
09:31 kados    right now, in 2.4, utf-8 is default
09:31 kados    encoding:
09:27 paul     explain what you're speaking of with Encoding ;-)
09:27 paul     kados : i'm back
08:45 paul     (still on phone)
08:45 paul     yep
08:43 kados    do they like 'subfield reordering' too?
08:42 kados    great! :-)
08:42 paul     IPT thanks a LOT the guy who wrote CloneTag & CloneSubfield !!!
08:42 kados    paul: as they won't need to edit the template directly
08:42 kados    paul: i think utf-8 catalogs will appreciate it
08:41 kados    paul: it took me about 5 minutes to add a TemplateEncoding variable to Auth.pm, and the templates ... I will add it to CVS and to default templates if you ask me too
08:40 kados    paul: well, rel24 now :-)
08:40 kados    paul: in rel22 and head
08:40 kados    paul: also, we must discuss encoding probs when you are available
08:17 kados    paul: that allows the librarian to specify the encoding scheme to be used in intranet and opac templates?
08:16 kados    paul: what do you think about a syspref for 'encoding'
08:16 kados    paul: quick questions when you're off phone
07:59 paul     how many do you charge for this feature ?
07:57 kados    and we can't afford to do it for free :-)
07:57 kados    because we have found everyone wants their Koha tweaked in some way
07:57 kados    if they complain, we drop it :-)
07:57 paul     (because if I don't propose, they don't ask !)
07:57 kados    it's a required item
07:56 paul     they want it, or you propose ?
07:56 kados    yes, most customers have custom css for at least color, some for layout as well
07:56 paul     (/me happy to see that it's wipo ;-) )
07:54 paul     you write a specific css for every customer ?
07:53 kados    which I can view fine in dumpmarc.pl
07:53 kados    ology a commercial success;
07:52 kados    rming your own company; 8. Financing the business start up; 9. Making your techn
07:52 kados     intellectual property; 5. Developing your ideas; 6. The licensing option; 7. Fo
07:52 kados    s commercialisation of your technology; 4. The difficult problem of valuation of
07:52 kados    o market; 2. So do you really have something of value? 3. The first steps toward
07:52 kados    roftHouse, Surrey, UK;  Recd 11/12/98;  Contents : 1. Bringing your technology t
07:52 kados    856    _aPrice : <9C> 36.00;    Inv.# CSI102254;    Date  30/11/98; Supplier : C
07:52 kados    856 contained:
07:52 kados    http://wipoopac.liblime.com/cgi-bin/koha/opac-MARCdetail.pl?bib=24436
07:52 kados    but the data in 856 was discarded by bulkmarcimport
07:52 kados    the record imported fine
07:51 kados    encoded
07:51 kados    and claims to be MARC-8
07:51 kados    yes
07:50 paul     was it iso2790 ?
07:49 kados    :-)
07:49 paul     so, the answer is no ;-)
07:49 kados    just wondering if you had seen it before
07:49 kados    yep, could be
07:46 paul     (thus my idea)
07:45 paul     ("mapping" is sometihng MARC::charset does)
07:45 paul     could it be a MARC::charset problem ?
07:45 paul     I never saw this before.
07:45 paul     "no mapping found at position 8".
07:45 paul     hi kados.
07:41 kados    no mapping found at position 8 in Price : ~\ 36.00;    Inv.# CSI102254;    Date  30/11/98; Supplier : CroftHouse, Surrey, UK;  Recd 11/12/98;  Contents : 1. Bringing your technology to market; 2. So do you really have something of value? 3. The first steps towards commercialisation of your technology; 4. The difficult problem of valuation of intellectual property; 5. Developing your ideas; 6. The licensing option; 7. Forming your own company; 8. Financing the busines
07:41 kados    I get the following error in the log several hundred times:
07:41 kados    paul: with bulkmarcimport
07:41 kados    paul: I'm importing some records and I'm afraid they are truncating
07:40 kados    paul: do you have a quick second?
07:40 kados    morning all
07:17 hdl      salut
07:13 osmoze   hdl ?
04:43 osmoze   (enfin désolé ^^)
04:42 osmoze   non c edst bon paul merci :)
04:42 paul     yep
04:42 paul     ye
04:41 osmoze   paul>  ?
04:40 osmoze   hdl>  ?
03:29 pierrick un gros client d'INEO
03:28 paul     "chez le fameux client" ?
03:28 pierrick non, pas un journée, je te rassure, de toute façon, je vais chez le fameux client dans une heure
03:25 paul     mais ne passe pas 1 journée là dessus ;-)
03:25 paul     bref, je ne suis pas trop sûr.
03:25 paul     les mots vides sont à mettre dans un fichier à part. Peut être que l'on génèrera ce fichier à partir de la table stopwords.
03:24 paul     quoi que finalement :
03:24 paul     qui utilise d'autres mécanismes.
03:24 paul     vu que la recherche srea gérée par zebra !
03:24 paul     ne perd pas de temps là dessus. Les mots vides seront inutiles dans Koha 3.0 probablement
03:24 paul     quoi que : STOP
03:24 paul     OK.
03:24 pierrick en partant du principe que les stopwords sont courts et qu'on peut faire peut d'opération dessus. Tu me diras ce que tu en penses. (initialement c'était juste pour ajouter la barre de pagination)
03:23 pierrick OK, je suis en train de reprendre la page de gestion des stopwords
03:22 paul     ca doit faire une 50aine de mots
03:22 paul     (dans la plupart des cas, j'ai ce qui est mis par défaut à l'install.
03:22 paul     plusieurs.
03:22 paul     ca dépend de la base.
03:17 pierrick paul, tu as combien de stopwords dans ta base de dev HEAD?
03:17 paul     mais j'y pense jamais !
03:17 paul     pierrick: ah, c'est vrai, KSirc aussi
03:16 pierrick s/chanel/channel/
03:16 pierrick parce que irssi ou Xchat font l'autocomplétion des nicks présent sur le chanel.
03:16 pierrick tu utilises quoi comme client IRC ?
03:16 paul     (et ton login apparait chez moi en vert pale, du coup le _ est difficile à voir. donc faut que je fasse attention si je veux te "bipper"
03:15 paul     je sais je sais. Ca m'amuse juste ;-)
03:15 pierrick désolé, problemes de connexion internet chez INEO
03:14 paul     bonne question...
03:14 paul     a t'il l'utilité d'un _ ou pas ?
03:14 paul     pierrick hésite beaucoup ce matin encore...
02:42 osmoze   (en fait, j ai fais l install avec phpmyadmin et au final pb de perl :()
02:41 osmoze   j avais commencer mais pas terminé...Tiens, tu me choppes au reveille, j ai rien prevu ce matin, j attaque :)
02:41 paul     bonjour d'abord, bonne remarque !
02:41 osmoze   bonjour
02:41 paul     est-ce que tu as eu le temps de tester l'install windows ?
02:40 paul     osmoze ?
02:40 paul     plein de français ce matin encore...
02:26 pierrick hi hdl & paul
02:25 hdl      hi pierrick
22:51 chris    now in koha?
22:51 russ     cos the other channel is too busy
16:47 owen     chris, you around?
16:46 owen     Charming
16:46 slef     Jet2 - I can't believe it's an airline.
16:46 slef     http://www.jet2.com/News.aspx?id=65 "Jet2.com [...] calls for lazy frogs to get back to work!"