Interesting, Let's throw in a few numbers. Imagine that you have a large queue, 
storefile 100 MB. Now imagine that every 30" you get 100 MB I/O to maintain 
that. First of all, if you crash you may loose up to the last 30" of messsages. 
On the other hand, if in those 30" you send ~20,000 sms x 2250 B/msg = 43 MB, 
you even get less I/O with the spool directory! Using these numbers you can do 
your own calculations for your own system.

The whole advantage to the spool is that the FS will provide indexing to the 
messages whereas it is not supported in the store-file. Like a database table 
without indeces. It is true that kannel doesn't need to match specific 
messages, just propably pulls from the top of the store. The I/O is more evenly 
spread over the period, your  spool files are up to the second, and on top of 
that you can manage it better. Just a bit of care to use a dedicated, 
appropriate filesystem.

Thanks a lot Alej,
Nikos
  ----- Original Message ----- 
  From: Alejandro Guerrieri 
  To: Tony Kirkham 
  Cc: Nikos Balkanas ; [email protected] 
  Sent: Thursday, December 04, 2008 6:18 PM
  Subject: Re: Removing an erroring message stuck in queue / file store


  Tony,

  Just a quick update on store file internals. I've been checking the code a 
little and the dumping of messages doesn't occur on each message change, but on 
a separate thread at given time intervals (something like every 30 seconds I 
think).

  That being said, I personally prefer the spool folder over the store file. 
It's more like a "Maildir over Mailbox" approach. The spool has a little more 
I/O stress (each message is updated/deleted at the moment of the change, 
compared to dumping the whole file at given intervals) but on the other side 
there's a lot less of locking going on.

  The only issue to keep in mind is the performance on journaling filesystems, 
but that only shows up on heavily loaded scenarios (maybe handling hundreds of 
thousands messages a day or more) so if you're not dealing with high traffic it 
won't be a problem at all. If you're planning your setup you can consider 
having a dedicated ext2 or xfs partition for this anyway.

  The spool folder was introduced time ago (more than a year I think) but it 
was after 1.4.1 was out, so you need a CVS version to take advantage of this. 
Anyway, I think a new stable release it's not far away.

  CVS code is as stable as 1.4.1 imho, I've been using it on production systems 
for years without a single problem.

  Regards,

  Alejandro Guerrieri


  On Thu, Dec 4, 2008 at 1:39 PM, Tony Kirkham <[EMAIL PROTECTED]> wrote:

    Thanks, guys.  This is a lot of great information.  I will be trying this 
and I will post my results.  

    One other question though, are there any advantages to the file store over 
the spool store?  Before this conversation, I had not understood what the spool 
store was.  I had only heard the term mentioned in passing.  I have been using 
older documentation (Kannel 1.4.1 User's Guide) that lists a parameter to the 
core configuration of "store-file" and that is what is currently in my config 
file and running on my system.  The actual Kannel code I am using is a CVS pull 
of Kannel from June of this year.  Now that I examine the latest, sort of, 
documentation (cvs-20081125) I no longer see the "store-file" parameter.  
Instead I see "store-type" and "store-location".

    I am assuming, now, that the spool storage type is a "recent" addition to 
Kannel and was added because it worked more effeciently than trying to rewrite 
a single file containing many messages.  Is this assumption correct?  

    Now that I know about the spool storage type I think I will be switching to 
it.  I can't currently see any advantage the file store has over the spool type 
and I see many disadvantages, the impetus of this entire conversation, to name 
one.  I am now wondering if my June Kannel code has these parameters in it or 
if I will have to get a more recent code base and recompile.  Any thoughts?

    Thanks again,

    -Tony




    2008/12/3 Nikos Balkanas <[EMAIL PROTECTED]>

      Definitely. I have a test kannel, hooked to fakesmsc, and I would always 
try out a copy of the "fixed" store file before commiting it. There is no 
header from what I can see, however there is a footer that has a list of all 
message ids. This is a bit more complex, not much though. I presumed that there 
was no CRC or indexing. It seems that the message has to be removed both from 
the body and the footer.

      Tony if you are watching this, a simple cat at the end won't do it, but 
something similar should.

      The footer has to be removed and saved in a different file. Then do the 
cat and insert the saved footer just before the new footer, so that the right 
order is preserved.

      Of course it goes without saying that the procedure has to be tested 
first on a test environment with fakesmsc. I could do it, except that i don't 
have any real messages.

      BR,
      Nikos
      ----- Original Message ----- 
        From: Alejandro Guerrieri 
        To: Nikos Balkanas 
        Cc: Tony Kirkham ; [email protected] 
        Sent: Thursday, December 04, 2008 4:20 AM
        Subject: Re: Removing an erroring message stuck in queue / file store


        Hmm, I'm not 100% sure you can append messages at the end of the store 
and get away with it... I'd first try it on a staging server.

        You can surely "merge" new messages on a spool by copying (you need to 
restart kannel for it to notice the new messages, though) but I don't know if 
is there any "header" or "footer" information on the store files that would 
break the format if appended together.

        I'm not saying it's not possible, just that I've never tried it and 
since I'm not _that_ fluent on the file store internals I cannot tell for sure.

        Regards,

        Alejandro


        2008/12/3 Nikos Balkanas <[EMAIL PROTECTED]>

          BTW, Tony. It doesn't seem that you will have significant downtime. 
If Alej is right you can just stop kannel momentarily to grab the store files 
and start it again. Once the problematic store files are fixed you can just 
stop kannel momentarily and "cat kannel.store.fixed >> kannel.store". Do the 
same for kannel.store.back and restart kannel. Just don't change from storefile 
to spool during the process.

          So you don't loose any messages and downtime can be minimal. Cool 
right?
            ----- Original Message ----- 
            From: Nikos Balkanas 
            To: Alejandro Guerrieri ; Tony Kirkham 
            Cc: [email protected] 
            Sent: Thursday, December 04, 2008 2:35 AM
            Subject: Re: Removing an erroring message stuck in queue / file 
store


            Thanks for pointing that out. Still it is easy to preview in "vim 
-b" (not hex) and can identify *any* of the fields Sender, recipient, text, 
messageID, so if anyone is missing there are a lot of matches in ASCII. The 
binary text is easy and straightforward. One can reasonably find the borders 
(patterns) of the previous and next message (presumably intact) and delete the 
in between.

            Tony, if the file is small enough to send through mail (zipped) and 
don't feel like trying it yourself, why don't you send it to me along with the 
message details to see what I can do about it.

            Alej from your words it seems that a spool directory is more 
efficient than a storefile. If anytime kannel sends a message has to delete and 
rewrite almost the whole store file, it is a big waste. Isn't so?

            BR,
            Nikos
              ----- Original Message ----- 
              From: Alejandro Guerrieri 
              To: Nikos Balkanas 
              Cc: Tony Kirkham ; [email protected] 
              Sent: Thursday, December 04, 2008 2:18 AM
              Subject: Re: Removing an erroring message stuck in queue / file 
store


              The 20 nulls are probably a lot of empty/unused fields, that 
doesn't necessary means that they'll be always empty... so don't assume that 
there should be that much. It's not a separator, just a pattern in your 
particular message queue set.

              When you edit, you have to make sure that you're actually cutting 
on the right places. Alas, if the message is in fact corrupted maybe it's not 
complete, so YMMV.

              Using the spool file is way easier, of course. Again, if you have 
a "broken" message (why is that it's another question) it's just a matter of 
determining which one is causing the problem and removing it.

              Regards,

              Alejandro


              2008/12/3 Nikos Balkanas <[EMAIL PROTECTED]>

                Hi,

                Just out of curiosity I openned a store file and it doesn't 
seem too difficult. You can edit it easily with "vim -b" and there are a lot of 
ASCII fields (sender, recipient, messageid), so you can search for your message 
easily. Each message seems to be bordered by 20 '\0' (NULLS). Can't miss them. 
Find offending message and delete by hand (assuming that you know which message 
is the offending one).

                This again assumes that there is no CRC or indexing of the 
messages as discussed by Alejandro. In other words, you can safely remove any 
message.

                Make a backup, try it and let us know of the result. Stop 
kannel, and repeat for both files.

                Cheers,
                Nikos
                  ----- Original Message ----- 
                  From: Alejandro Guerrieri 
                  To: Tony Kirkham 
                  Cc: [email protected] 
                  Sent: Wednesday, December 03, 2008 7:14 PM
                  Subject: Re: Removing an erroring message stuck in queue / 
file store


                  Ah, you're using the "file" store, not "spool" maybe?

                  AFAIK, there's no document on the storage format. It's mainly 
a dump from the "Msg" structure, but the "Octstr" objects can have nulls in the 
middle, so it's not as easy to parse and, it doesn't have a fixed length and 
(if you're still willing to venture at it) you'll need an hex editor if you 
don't want to mess the whole file (the nulls in the middle of the file will 
probably render any text editor unusable).

                  If you use the "spool" store, each message is stored on an 
independent file on a directory tree. Removing the file removes the message.

                  The only problem is that kannel only uses the files as a 
backup mechanism and it doesn't refresh it during runtime. To clarify:

                  * When a message arrives (or is enqueued to be sent), kannel 
adds it to an in-memory queue and stores a copy on disk. An internal uuid is 
used to keep track of messages on both places.

                  * When kannel ends processing the message, it removes it from 
the in-memory queue and deletes the message from disk. Depending on the store 
type being used, this means removing it from the store file and rewriting the 
file, or remove an entire file from the spool tree.

                  * If kannel crashes or is shutdown with messages pending, the 
messages are kept on the file/spool queue. During a shutdown, the in-memory 
queue is dumped to file again afaik.

                  * When kannel starts, it first load the messages from the 
file/spool queue.

                  * On the particular case of the file store, a backup file is 
kept at all times in case the main file is deleted or corrupted.

                  So, from the above, it's clear why you need to stop kannel 
before deleting the file(s).

                  You can stop kannel, delete the store and store.bak files, or 
remove the whole spool tree and start again. It will remove ALL your pending 
messages. If you can figure out which message is causing the problem, you can 
remove it from the spool (if you're using the spool store) as well. With the 
file store, you'd need to figure out from the binary, which is near to 
impossible without some specially crafted tools imho.

                  Regards,

                  Alejandro


                  On Wed, Dec 3, 2008 at 1:15 PM, Tony Kirkham <[EMAIL 
PROTECTED]> wrote:

                    How?  By hand, in my favorite editor?  I have tried this, 
with bad results.  There are special characters, at least vi displays them that 
way, that I can't interpret so I do not know where, exactly, one message stops 
and the next starts.  Is there a document that describes the store file in 
detail?

                    -Tony 



                    On Tue, Dec 2, 2008 at 4:08 PM, Alejandro Guerrieri <[EMAIL 
PROTECTED]> wrote:

                      And do it without kannel running, of course ;)

                      If you're using the spool store file, you can remove the 
individual message (assuming that you can find out what's the message causing 
the error).

                      Regards,

                      Alejandro 



                      On Tue, Dec 2, 2008 at 5:49 PM, info.ubichip <[EMAIL 
PROTECTED]> wrote:

                        Hello,

                        when you trash the .store file, don't forget to remove 
the .store.bak as well otherwise, it will retry to send all the remaining 
message.

                        regards




--------------------------------------------------------

                        From: Tony Kirkham [mailto:[EMAIL PROTECTED] 

                        Sent: lundi 1 dιcembre 2008 21:32 

                        To: [email protected]
                        Subject: Removing an erroring message stuck in queue / 
file store



                        Is there a way to remove a message from the file store 
that is erroring continuously allowing the other messages to be sent?

                        If I simply restart Kannel sometimes the other messages 
get sent and sometimes they do not.  Why is that?
                        If they all get sent but the erroring message I can 
simply replace the file store with another file with a restart of Kannel, but I 
would like to know if there is a way to remove the message by some other means.

                        Thanks,

                        -Tony














Reply via email to