[
https://issues.apache.org/jira/browse/MAILBOX-170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13219979#comment-13219979
]
Eric Charles commented on MAILBOX-170:
--------------------------------------
Hi Ioan, imho the storage of the raw mail in hdfs sequence file can be an
option.
We will need to measure the efficiency of this implementation compared to a
pure hbase one (you know the story "... hdfs is for very very large files...".
Upon a distributed mailbox locker (JAMES-1388) we also need a mechanism to
query efficiently the mailbox (in case of imap search queries for example - not
covered neither in the current hbase impl).
btw, please ensure the existing mailbox-hbase remain as such (without hadoop),
and start the implementation in a mailbox-hadoop project.
> Store mailboxes in HDFS SequenceFile
> ------------------------------------
>
> Key: MAILBOX-170
> URL: https://issues.apache.org/jira/browse/MAILBOX-170
> Project: James Mailbox
> Issue Type: Improvement
> Components: hbase
> Affects Versions: 0.4
> Reporter: Ioan Eugen Stan
> Assignee: Ioan Eugen Stan
> Fix For: 0.5
>
>
> The current implementation stores messages directly in HBase. I believe a
> better approach is to store the messages as SequenceFiles in the <mail_ID>:
> <message_data>. HBase will store sequence File offests in the SequenceFile
> for each mailbox for fast access similar to a hadoop MapFile.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]