Well, I stand corrected, and do think he has a point. OCFS2 is heavilly 
tested on Oracle RAC related scenarios, large files, high I/O rates, etc. On 
other scenarios, dealing with ACLs, and multiple user ids, it might not be so 
throrougly tested. But I do see a large effort to resolve any issues that are 
reported.

     Also there is the point that it is designed for Oracle database use. So 
the fencing and reboots are intentional, and they happen fast. All nodes in 
Oracle RAC freeze when one of the nodes is unreachable, either by SAN or 
network failures, this is inevitable due to the architecture of the database 
itself, so it is highly desirable that the misbehaving node go down fast, so 
that the other nodes can start crash recovery of the failed node, during which 
the database is still hung, since the crashed node memory structures are lost 
and need to be recovered from the redo and rollback segments. 

    One thing that I think could be improved is that there is no communication 
between the O2CB and CRS, so if the timeouts are not setup correctly, one can 
have a situation where O2CB resolves to fence one node, and CRS fences the 
other. This seems to not happen with vendor clusterware, that always have to be 
integrated with CRS to be supported with Oracle RAC.

    Other thing that would be interesting, if technically feasible, would be to 
have a option for OCFS2 behave more like NFS, and instead of rebooting, fence 
all I/O to the filesystems on the evicted node, waiting to see if it can rejoin 
the cluster. When this situation occurs the fenced node would need to consider 
all the blocks on the buffer cache as invalid, so that other nodes could modify 
them safely. Not sure how filesystem locks could be handled.

Regards,
Luis

Sunil Mushran <[EMAIL PROTECTED]> wrote: Randy Ramsdell wrote:
> I am not taking sides but I think Alexei's postings are a positive 
> contribution to this project and more of a contribution than the 
> lurkers who write nothing to the list. His feedback does have merit 
> and should be considered valuable although it is critical of ocfs2. 
> We, although I strongly disagreed, have stopped using ocfs2 and we 
> WERE using it in a production environment.
>
The so-called lurkers file bugs, validate patches. That helps everybody.

His feedback is useless. Ranting about the core design does not help
anyone. Dreaming up possible bugs, without actually looking for them,
is useless.

When you encounter a bug, file a bugzilla with as much relevant
information as possible. If you do that, we can fix the bug and everyone
benefits. Sending emails like a broken record is not a positive 
contribution.

BTW, I hear great things about Vista.

_______________________________________________
Ocfs2-users mailing list
[email protected]
http://oss.oracle.com/mailman/listinfo/ocfs2-users


       
---------------------------------
Need a vacation? Get great deals to amazing places on Yahoo! Travel. 
_______________________________________________
Ocfs2-users mailing list
[email protected]
http://oss.oracle.com/mailman/listinfo/ocfs2-users

Reply via email to