March 2015 Report

2015-02-26 Thread Marvin Humphrey

Greetings, {podling} developers,

The marvin automated report reminder script didn't fire for whatever
reason this week, so I'm sending out a bulk reminder manually for the 18
podlings who are expected to submit a March 2015 report.  Boilerplate
repinder text below.

Best,

Marvin Humphrey

---

Dear podling,

This email was sent by an automated system on behalf of the Apache
Incubator PMC. It is an initial reminder to give you plenty of time to
prepare your quarterly board report.

The board meeting is scheduled for Wed, 18 March 2015, 10:30 am PST.
The report for your podling will form a part of the Incubator PMC
report. The Incubator PMC requires your report to be submitted 2 weeks
before the board meeting, to allow sufficient time for review and
submission (Wed, March 4th).

Please submit your report with sufficient time to allow the incubator
PMC, and subsequently board members to review and digest. Again, the
very latest you should submit your report is 2 weeks prior to the board
meeting.

Thanks,

The Apache Incubator PMC

Submitting your Report

--

Your report should contain the following:

*   Your project name
*   A brief description of your project, which assumes no knowledge of
the project or necessarily of its field
*   A list of the three most important issues to address in the move
towards graduation.
*   Any issues that the Incubator PMC or ASF Board might wish/need to be
aware of
*   How has the community developed since the last report
*   How has the project developed since the last report.

This should be appended to the Incubator Wiki page at:

http://wiki.apache.org/incubator/March2015

Note: This is manually populated. You may need to wait a little before
this page is created from a template.

Mentors
---

Mentors should review reports for their project(s) and sign them off on
the Incubator wiki page. Signing off reports shows that you are
following the project - projects that are not signed may raise alarms
for the Incubator PMC.

Incubator PMC 



march incubator report

2015-02-26 Thread Billie Rinaldi
Looks like HTrace is scheduled to report in March, but I don't think we got
a notification.  Due date is Wednesday, March 4th:
https://wiki.apache.org/incubator/March2015


[REPORT] HTrace March 2015

2015-02-26 Thread Lewis John Mcgibbney
Hi Folks,
Please see below, this has been added to the wiki

HTraceHTrace is a tracing framework intended for use with distributed
systemswritten in java.HTrace has been incubating since 2014-11.Three
most important issues to address in the move towards graduation:  1.
Continue to grow the HTrace community  2. Continue to develop and
release stable HTrace incubating artifacts  3. Continue to explore the
integration of the HTrace framework into other Apache productsAny
issues that the Incubator PMC (IPMC) or ASF Board wish/need to beaware
of?NoHow has the community developed since the last report?There has
been a bunch of mailing list activity relating directlyto issue 3
above e.g. better integration of HTrace into HBase/HDFS.HTrace is
being represented at ApacheCon 2015 NA in April with a
presentationIntroducing Apache HTrace: An End-to-End Tracing
Framework for Distributed Systems - Colin McCabe, Cloudera -
http://sched.co/2P8QHow has the project developed since the last
report?The codebase has seen about 30 odd commits since last
reporting.Jira continues to see activity which is encouraging as
HTrace communityprogresses towards next incubating release.Date of
last release:   2015-20-01 htrace-3.1.0-incubatingWhen were the last
committers or PMC members elected?Abraham Elmahrek was elected to
become an HTrace committer onWed, 11 Feb, 2-15.Signed-off-by:   [
](htrace) Jake Farrell  [ ](htrace) Todd Lipcon  [X](htrace) Lewis
John Mcgibbney  [ ](htrace) Andrew Purtell  [ ](htrace) Billie Rinaldi
 [ ](htrace) Michael StackShepherd/Mentor notes:


Ta
Lewis


-- 
*Lewis*


Re: [REPORT] HTrace March 2015

2015-02-26 Thread Elliott Clark
Looks good.
Thanks
+1

On Thu, Feb 26, 2015 at 12:41 PM, Stack st...@duboce.net wrote:

 Thanks Lewis. +1
 St.Ack

 On Thu, Feb 26, 2015 at 10:58 AM, Lewis John Mcgibbney 
 lewis.mcgibb...@gmail.com wrote:

  Hi Folks,
  Please see below, this has been added to the wiki
 
  HTraceHTrace is a tracing framework intended for use with distributed
  systemswritten in java.HTrace has been incubating since 2014-11.Three
  most important issues to address in the move towards graduation:  1.
  Continue to grow the HTrace community  2. Continue to develop and
  release stable HTrace incubating artifacts  3. Continue to explore the
  integration of the HTrace framework into other Apache productsAny
  issues that the Incubator PMC (IPMC) or ASF Board wish/need to beaware
  of?NoHow has the community developed since the last report?There has
  been a bunch of mailing list activity relating directlyto issue 3
  above e.g. better integration of HTrace into HBase/HDFS.HTrace is
  being represented at ApacheCon 2015 NA in April with a
  presentationIntroducing Apache HTrace: An End-to-End Tracing
  Framework for Distributed Systems - Colin McCabe, Cloudera -
  http://sched.co/2P8QHow has the project developed since the last
  report?The codebase has seen about 30 odd commits since last
  reporting.Jira continues to see activity which is encouraging as
  HTrace communityprogresses towards next incubating release.Date of
  last release:   2015-20-01 htrace-3.1.0-incubatingWhen were the last
  committers or PMC members elected?Abraham Elmahrek was elected to
  become an HTrace committer onWed, 11 Feb, 2-15.Signed-off-by:   [
  ](htrace) Jake Farrell  [ ](htrace) Todd Lipcon  [X](htrace) Lewis
  John Mcgibbney  [ ](htrace) Andrew Purtell  [ ](htrace) Billie Rinaldi
   [ ](htrace) Michael StackShepherd/Mentor notes:
 
 
  Ta
  Lewis
 
 
  --
  *Lewis*
 



Re: HTrace for Nutch 2.x Search Stack

2015-02-26 Thread Nick Dimiduk
Hi Lewis,

The results of TRACE level logging will give you messages from the various
system components according to whatever those developers thought relevant
for exposing from the application. Assembling the HTrace spans from a
single trace will give you something similar, but structured according to
the major boundaries within the system. The original intention is for
tracing a single request through multiple RPC's across multiple systems. At
the very least, you'd see these boundary points and the amount of time
spent in each component. Each component's developers are free to use the
HTrace API to attach annotations to their trace spans. These annotations
are arbitrary messages, accompanied by timestamps, included by the whims of
the projects' developers.

I guess the big difference is that trace spans collected by HTrace are a
bit more structured than simple log messages. There's a parent-child
relationship between spans, and this hierarchy exists across threads within
a process and across processes within a distributed application.

Maybe someone else can do a better job explaining...

-n

On Thu, Feb 26, 2015 at 12:57 PM, Lewis John Mcgibbney 
lewis.mcgibb...@gmail.com wrote:

 Hi Folks,
 Right now we are able to initiate TRACE within Log4j file in Nutch 2.X [0].
 Nutch 2.X offer storage abstraction for storage of Webpage and Host data
 through use of Apache Gora.
 Gora backend support includes

- Apache Avro 1.7.6
- Apache Hadoop 1.2.1 and 2.5.2
- Apache HBase 0.98.8-hadoop2
- Apache Cassandra 2.0.2
- Apache Solr 4.10.3
- MongoDB 2.6.X
- Apache Accumlo 1.5.1

 Parts of the Nutch 2.X search stack include Nutch, Gora, optionally Solr
 and/or Elastic Search for indexing context and optionally Hadoop for
 running it all on.

 As a developer of Nutch and Gora am very keen to see a more verbose tracing
 mechanism (potentially with some visualization) for debugging our search
 stack deployments. Can I please kick off conversation on what the
 difference and benefits of H Trace are over log4j TRACE level logging.

 Once I understand the above, I would like to obtain advice on the
 integration points for having HTrace as the tracing mechanism for the stack
 I describe above.
 Thanks
 Lewis

 [0] https://github.com/apache/nutch/blob/2.x/conf/log4j.properties
 --
 *Lewis*



Re: [REPORT] HTrace March 2015

2015-02-26 Thread Nick Dimiduk
+1 by me.

On Thu, Feb 26, 2015 at 10:58 AM, Lewis John Mcgibbney 
lewis.mcgibb...@gmail.com wrote:

 Hi Folks,
 Please see below, this has been added to the wiki

 HTraceHTrace is a tracing framework intended for use with distributed
 systemswritten in java.HTrace has been incubating since 2014-11.Three
 most important issues to address in the move towards graduation:  1.
 Continue to grow the HTrace community  2. Continue to develop and
 release stable HTrace incubating artifacts  3. Continue to explore the
 integration of the HTrace framework into other Apache productsAny
 issues that the Incubator PMC (IPMC) or ASF Board wish/need to beaware
 of?NoHow has the community developed since the last report?There has
 been a bunch of mailing list activity relating directlyto issue 3
 above e.g. better integration of HTrace into HBase/HDFS.HTrace is
 being represented at ApacheCon 2015 NA in April with a
 presentationIntroducing Apache HTrace: An End-to-End Tracing
 Framework for Distributed Systems - Colin McCabe, Cloudera -
 http://sched.co/2P8QHow has the project developed since the last
 report?The codebase has seen about 30 odd commits since last
 reporting.Jira continues to see activity which is encouraging as
 HTrace communityprogresses towards next incubating release.Date of
 last release:   2015-20-01 htrace-3.1.0-incubatingWhen were the last
 committers or PMC members elected?Abraham Elmahrek was elected to
 become an HTrace committer onWed, 11 Feb, 2-15.Signed-off-by:   [
 ](htrace) Jake Farrell  [ ](htrace) Todd Lipcon  [X](htrace) Lewis
 John Mcgibbney  [ ](htrace) Andrew Purtell  [ ](htrace) Billie Rinaldi
  [ ](htrace) Michael StackShepherd/Mentor notes:


 Ta
 Lewis


 --
 *Lewis*



Re: Trace HBase/HDFS with HTrace

2015-02-26 Thread Masatake Iwasaki

 perhaps a double detach.

It turned out to be double detach of NullScope singleton and I filed 
HTRACE-119.
I got the error on meta assignments on regionserver startup without 
starting trace spans..


Thanks,
Masatake

On 2/26/15 09:58, Colin P. McCabe wrote:

Hmm.  Looking at that error, my guess would be that there is an
incorrect usage of TraceScope#detach going on somewhere in hbase...
perhaps a double detach.  But I could be wrong.  We added some code
recently to catch issues like this.

best,
Colin

On Wed, Feb 25, 2015 at 12:28 AM, Masatake Iwasaki
iwasak...@oss.nttdata.co.jp wrote:

I tried hbase-1 built against today's htrace-3.2.0-SNAPSHOT (with quick fix
to TestHTraceHooks).
I got the error below in regionserver log.
I will dig this tomorrow.::

   2015-02-25 00:18:29,270 ERROR [RS_OPEN_META-centos7:16201-0]
htrace.Tracer: Tried to detach trace span null but it has already been
detached.
   2015-02-25 00:18:29,271 ERROR [RS_OPEN_META-centos7:16201-0]
handler.OpenRegionHandler: Failed open of region=hbase:meta,,1.1588230740,
starting to roll back the global memstore size.
   java.lang.RuntimeException: Tried to detach trace span null but it has
already been detached.
   at org.apache.htrace.Tracer.clientError(Tracer.java:61)
   at org.apache.htrace.TraceScope.detach(TraceScope.java:57)
   at
org.apache.hadoop.hbase.regionserver.wal.FSHLog.sync(FSHLog.java:1559)
   at
org.apache.hadoop.hbase.regionserver.wal.WALUtil.writeRegionEventMarker(WALUtil.java:94)
   at
org.apache.hadoop.hbase.regionserver.HRegion.writeRegionOpenMarker(HRegion.java:910)
   at
org.apache.hadoop.hbase.regionserver.HRegion.openHRegion(HRegion.java:4911)
   at
org.apache.hadoop.hbase.regionserver.HRegion.openHRegion(HRegion.java:4874)
   at
org.apache.hadoop.hbase.regionserver.HRegion.openHRegion(HRegion.java:4845)
   at
org.apache.hadoop.hbase.regionserver.HRegion.openHRegion(HRegion.java:4801)
   at
org.apache.hadoop.hbase.regionserver.HRegion.openHRegion(HRegion.java:4752)
   at
org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler.openRegion(OpenRegionHandler.java:356)
   at
org.apache.hadoop.hbase.regionserver.handler.OpenRegionHandler.process(OpenRegionHandler.java:126)
   at
org.apache.hadoop.hbase.executor.EventHandler.run(EventHandler.java:128)
   at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
   at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
   at java.lang.Thread.run(Thread.java:745)



On 2/24/15 18:27, Colin P. McCabe wrote:

Thanks for trying this, Mastake.  I've got HDFS working on my cluster
with tracing and LocalFileSpanReceiver.  Did you try using HBase +
HDFS with LocalFileSpanReceiver?  Be sure to use a build including
HTRACE-112 since LFSR was kind of busted prior to that.

I'm going to do a longer writeup about getting HDFS + HBase working
with other span receivers just as soon as I finish stomping a few more
bugs.

best,
Colin

On Tue, Feb 24, 2015 at 12:04 PM, Masatake Iwasaki
iwasak...@oss.nttdata.co.jp wrote:

Hi,

Thanks for trying this. I am sorry for late reply.

I tried this today
by hbase-1.0.1-SANPSHOT built with
{{-Dhadoop-two.version=2.7.0-SNAPSHOT}}
in pseudo distributed cluster
but failed to get end-to-end trace.

I checked that
* tracing works for both of hbase and hdfs,
* hbase runs with 2.7.0-SNAPSHOT jar of hadoop.

When I did do put with tracing on,
I saw span named FSHLog.sync with annotations such as
syncing writer and writer synced.
The code for tracing in FSHLog worked at least.

I'm still looking into this.
If it turned out that tracing spans are not reached to
actual HDFS writer thread in HBase, I will file a JIRA.

# We need hadoop-2.6.0 or higher in order to trace HDFS.
# Building hbase from source with {{-Dhadoop-two.version=2.6.0}}
# is straight forward way to do this
# because the binary release of hbase-1.0.0 bundles hadoop-2.5.1 jars.

Masatake


On 2/11/15 08:56, Nick Dimiduk wrote:

Hi Joshua,

In theory there's nothing special for you to do. Just issue your query
to
HBase with tracing enabled. The active span will go through HBase, down
into HDFS, and back again. You'll need both systems collecting spans
into
the same place so that you can report on the complete trace tree.

I've not recently tested the end-to-end, but I believe it's all there.
If
not, it's a bug -- this is an intended use case. Can you give it a try
and let us know how it goes?

FYI, 0.99.x are preview releases of HBase and not for production use.
Just
so you know :)

-n

On Wednesday, February 11, 2015, Chunxu Tang chunxut...@gmail.com
wrote:


Hi all,

Now I’m exploiting HTrace to trace request level data flows in HBase
and
HDFS. I have successfully traced HBase and HDFS by using HTrace,
respectively.

After that, I combine HBase and HDFS together and I want to just send a
PUT/GET request to 

Re: [REPORT] HTrace March 2015

2015-02-26 Thread Colin P. McCabe
+1.

Thanks, Lewis.
C.

On Thu, Feb 26, 2015 at 10:58 AM, Lewis John Mcgibbney
lewis.mcgibb...@gmail.com wrote:
 Hi Folks,
 Please see below, this has been added to the wiki

 HTraceHTrace is a tracing framework intended for use with distributed
 systemswritten in java.HTrace has been incubating since 2014-11.Three
 most important issues to address in the move towards graduation:  1.
 Continue to grow the HTrace community  2. Continue to develop and
 release stable HTrace incubating artifacts  3. Continue to explore the
 integration of the HTrace framework into other Apache productsAny
 issues that the Incubator PMC (IPMC) or ASF Board wish/need to beaware
 of?NoHow has the community developed since the last report?There has
 been a bunch of mailing list activity relating directlyto issue 3
 above e.g. better integration of HTrace into HBase/HDFS.HTrace is
 being represented at ApacheCon 2015 NA in April with a
 presentationIntroducing Apache HTrace: An End-to-End Tracing
 Framework for Distributed Systems - Colin McCabe, Cloudera -
 http://sched.co/2P8QHow has the project developed since the last
 report?The codebase has seen about 30 odd commits since last
 reporting.Jira continues to see activity which is encouraging as
 HTrace communityprogresses towards next incubating release.Date of
 last release:   2015-20-01 htrace-3.1.0-incubatingWhen were the last
 committers or PMC members elected?Abraham Elmahrek was elected to
 become an HTrace committer onWed, 11 Feb, 2-15.Signed-off-by:   [
 ](htrace) Jake Farrell  [ ](htrace) Todd Lipcon  [X](htrace) Lewis
 John Mcgibbney  [ ](htrace) Andrew Purtell  [ ](htrace) Billie Rinaldi
  [ ](htrace) Michael StackShepherd/Mentor notes:


 Ta
 Lewis


 --
 *Lewis*