Dear Wiki user,

You have subscribed to a wiki page or wiki category on "James Wiki" for change 
notification.

The "GSOC2011" page has been changed by EricCharles.
http://wiki.apache.org/james/GSOC2011?action=diff&rev1=1&rev2=2

--------------------------------------------------

  = James at Google Summer of Code 2001 =
+ James (Java Application Mail Server) is a set of mail-related libraries 
bundled in a server (http://james.apache.org). It supports standard mails 
protocols (smtp, pop3, imap4) and can store the mails in different technologies 
(maildir, database, jcr). We are looking for Students to help add more 
distributed storages (hadoop and nosql). We are also looking for Students to 
bring end-user with more functionality such as mails filter/categorization and 
"out-of-office".
- 
- James (Java Application Mail Server) is a set of mail-related libraries 
bundled in a server (http://james.apache.org).
- It supports standard mails protocols (smtp, pop3, imap4) and can store the 
mails in different technologies (maildir, database, jcr).
- We are looking for Students to help add more distributed storages (hadoop and 
nosql).
- We are also looking for Students to bring end-user with more functionality 
such as mails filter/categorization and "out-of-office".
  
  == Design and implement a distributed mailbox using Hadoop ==
+ The mailbox subproject (http://james.apache.org/mailbox/) supports maildir, 
SQL database (via JPA) and Java Content Repository (JCR) as technology for mail 
storage. This flexibility is achieved thanks to a api design that allows to 
abstract the mail storage from the protocols, allowing to implement We need to 
better support distributed storage such as Hadoop HDFS. The James mailbox API 
will be used. A first step will be to design how to interact with Hadoop 
(native api, gora incubator at apache,...). The second step will be to 
implement the HDFS mailbox (maildir mailbox is similar and can be an 
inspiration). A single james server will still be deployed because we don't 
have any distributed UID generation.
- The mailbox subproject (http://james.apache.org/mailbox/) supports maildir, 
SQL database (via JPA) and Java Content Repository (JCR) as technology for mail 
storage.
- This flexibility is achieved thanks to a api design that allows to abstract 
the mail storage from the protocols, allowing to implement
- We need to better support distributed storage such as Hadoop HDFS.
- The James mailbox API will be used. A first step will be to design how to 
interact with Hadoop (native api, gora incubator at apache,...). The second 
step will be to implement the HDFS mailbox (maildir mailbox is similar and can 
be an inspiration).
- A single james server will still be deployed because we don't have any 
distributed UID generation.
  
  == Distributed UID generation ==
- IMAP4 need to generate incremental UID. This is now achieved in James IMAP 
subproject (http://james.apache.org/imap) with a UidProvider interface which is 
implemented in memory, but which does not allow distributed working of the 
solution.
+ IMAP4 need to generate incremental UID. This is now achieved in James IMAP 
subproject (http://james.apache.org/imap) with a UidProvider interface which is 
implemented in memory, but which does not allow distributed working of the 
solution. A DistributedUidProvider must be designed (based on distributed 
memory cache such as hazelcast for example, or any other solution), and 
implemented.
- A DistributedUidProvider must be designed (based on distributed memory cache 
such as hazelcast for example, or any other solution), and implemented.
  
- == Design and implement machine learning filters and categorization for mail==
+ == Design and implement machine learning filters and categorization for mail 
==
+ Anti-spam functionality based on SpamAssassin is available at James (base on 
mailets http://james.apache.org/mailet). Bayesian mailets are also available, 
but not completely integrated/documented. We are willing to align the existing 
implementation with any modern anti-spam solution based on powerfull machine 
learning implementation (such as apache mahout). We are also willing to extend 
the machine learning usage to some mail categorization (spam vs not-spam is a 
first category, we can extend it to any additional category we can imagine).
- Anti-spam functionality based on SpamAssassin is available at James (base on 
mailets http://james.apache.org/mailet).
- Bayesian mailets are also available, but not completely integrated/documented.
- We are willing to align the existing implementation with any modern anti-spam 
solution based on powerfull machine learning implementation (such as apache 
mahout).
- We are also willing to extend the machine learning usage to some mail 
categorization (spam vs not-spam is a first category, we can extend it to any 
additional category we can imagine).
  
  == Implement additional SIEVE RFCs ==
  [fill in description]
@@ -31, +19 @@

  [fill in description]
  
  == Add "out-of-office" functionality ==
+ A frequently asked function is to have the ability to set per user a 
"out-of-office". In that case, the sender will automatically receive a default 
mail saying the recipient is not there. The API and implementation must be 
defined (based on a mailet or not). The way the end-user will set/unset his 
"out-of-office" as the message that will be send must also be imagined (via 
hupa webmail for example).
- A frequently asked function is to have the ability to set per user a 
"out-of-office". In that case, the sender will automatically receive a default 
mail saying the recipient is not there.
- The API and implementation must be defined (based on a mailet or not).
- The way the end-user will set/unset his "out-of-office" as the message that 
will be send must also be imagined (via hupa webmail for example).
  

Reply via email to