[jira] [Created] (GIRAPH-1224) Allow job to succeed if input is empty

2019-09-13 Thread Maja Kabiljo (Jira)
Maja Kabiljo created GIRAPH-1224:


 Summary: Allow job to succeed if input is empty
 Key: GIRAPH-1224
 URL: https://issues.apache.org/jira/browse/GIRAPH-1224
 Project: Giraph
  Issue Type: New Feature
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


If input is empty we always fail, but sometimes when it's part of bigger 
workflow we might want to let job succeed, add an option for that.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)


[jira] [Created] (GIRAPH-1222) Allow output formats to have writing setup and finalization

2019-07-25 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1222:


 Summary: Allow output formats to have writing setup and 
finalization
 Key: GIRAPH-1222
 URL: https://issues.apache.org/jira/browse/GIRAPH-1222
 Project: Giraph
  Issue Type: New Feature
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


Sometimes output formats need custom logic to be executed once per worker, 
before and after writers are being used. Add callbacks to allow for that.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Created] (GIRAPH-1215) Make FixedCapacityHeaps work with 0 capacity

2018-12-05 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1215:


 Summary: Make FixedCapacityHeaps work with 0 capacity
 Key: GIRAPH-1215
 URL: https://issues.apache.org/jira/browse/GIRAPH-1215
 Project: Giraph
  Issue Type: Bug
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


Currently FixedCapacityHeaps throw an exception when they are used with 
capacity 0.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (GIRAPH-1213) Fix issues with network requests retries and add more logging

2018-11-29 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1213:


 Summary: Fix issues with network requests retries and add more 
logging
 Key: GIRAPH-1213
 URL: https://issues.apache.org/jira/browse/GIRAPH-1213
 Project: Giraph
  Issue Type: Bug
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


Fixing two bugs:
 * When channel fails, we are currently retrying all requests towards the 
destination machine from the channel, instead of just ones which are happening 
on the concrete channel.
 * In practice, we've noticed BlockingOperationException can get thrown when we 
wait to connect on channel in which case we silently don't send the request we 
are trying to send, so catching this exception and retrying instead.

Also added logging of channel ids to be able to debug issues related to network 
requests not delivering easier.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (GIRAPH-1212) Fix DefaultJobProgressTracker when splitMasterWorker=false

2018-11-26 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1212:


 Summary: Fix DefaultJobProgressTracker when splitMasterWorker=false
 Key: GIRAPH-1212
 URL: https://issues.apache.org/jira/browse/GIRAPH-1212
 Project: Giraph
  Issue Type: Bug
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


DefaultJobProgressTracker assumes we are using numWorkers+1 mappers, fix that



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (GIRAPH-1211) Make retrying to send network requests after timeout optional

2018-11-06 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1211:


 Summary: Make retrying to send network requests after timeout 
optional
 Key: GIRAPH-1211
 URL: https://issues.apache.org/jira/browse/GIRAPH-1211
 Project: Giraph
  Issue Type: New Feature
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


Using counters added in GIRAPH-1205 we were able to confirm that resending 
network requests after timeout almost never succeeds, so add an option to fail 
early instead of keep trying to resend these network requests indefinitely.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (GIRAPH-1200) Add counters for network request resends

2018-09-14 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1200:


 Summary: Add counters for network request resends
 Key: GIRAPH-1200
 URL: https://issues.apache.org/jira/browse/GIRAPH-1200
 Project: Giraph
  Issue Type: Improvement
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


Expose statistics around network requests which we had to resend.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (GIRAPH-1184) Don't access configuration on every message

2018-04-05 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1184:


 Summary: Don't access configuration on every message
 Key: GIRAPH-1184
 URL: https://issues.apache.org/jira/browse/GIRAPH-1184
 Project: Giraph
  Issue Type: Improvement
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


Cache whether we are using message size encoding from configuration to prevent 
accessing conf on every message.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (GIRAPH-1170) Add logging for out-of-core

2018-03-23 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1170.
--
Resolution: Fixed
  Assignee: Dionysios Logothetis

> Add logging for out-of-core
> ---
>
> Key: GIRAPH-1170
> URL: https://issues.apache.org/jira/browse/GIRAPH-1170
> Project: Giraph
>  Issue Type: Improvement
>Reporter: Dionysios Logothetis
>Assignee: Dionysios Logothetis
>Priority: Minor
>
> |Adding some debugging information for when reading a partition from disk 
> fails.|
>  
>  
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (GIRAPH-1160) Fix memory estimation in MemoryEstimatorOrcal

2018-03-23 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1160.
--
Resolution: Fixed
  Assignee: Dionysios Logothetis

> Fix memory estimation in MemoryEstimatorOrcal
> -
>
> Key: GIRAPH-1160
> URL: https://issues.apache.org/jira/browse/GIRAPH-1160
> Project: Giraph
>  Issue Type: Bug
>Reporter: Dionysios Logothetis
>Assignee: Dionysios Logothetis
>Priority: Major
>
> Method MemoryEstimatorOracle.calculateRegression() exits if the number of 
> valid columns to use for the regression is not the same as the total number 
> of columns. This is wrong, the regression can run on only the valid columns. 
> This causes the memory estimation to be very off.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (GIRAPH-1168) Instantiate OutEdges through Factory class

2018-03-23 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1168.
--
Resolution: Fixed
  Assignee: Dionysios Logothetis

> Instantiate OutEdges through Factory class
> --
>
> Key: GIRAPH-1168
> URL: https://issues.apache.org/jira/browse/GIRAPH-1168
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Dionysios Logothetis
>Assignee: Dionysios Logothetis
>Priority: Major
>
> Sometimes the instantiation of an OutEdges implementation might have large 
> overhead, e.g. if it access the configuration.  Instead of creating it 
> directly, introduce a factory class that can be instantiated once. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (GIRAPH-1153) Update json dependency version

2018-03-23 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1153.
--
Resolution: Fixed
  Assignee: Dionysios Logothetis

https://github.com/apache/giraph/pull/43

> Update json dependency version
> --
>
> Key: GIRAPH-1153
> URL: https://issues.apache.org/jira/browse/GIRAPH-1153
> Project: Giraph
>  Issue Type: Improvement
>Reporter: Dionysios Logothetis
>Assignee: Dionysios Logothetis
>Priority: Minor
>
> This is a pretty old json version, conflicts with newer ones. Updating to a 
> more recent.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (GIRAPH-1175) Avoid evaluation of Partition.getEdgeCount in log line

2018-03-23 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1175.
--
Resolution: Fixed
  Assignee: Dionysios Logothetis

> Avoid evaluation of Partition.getEdgeCount in log line
> --
>
> Key: GIRAPH-1175
> URL: https://issues.apache.org/jira/browse/GIRAPH-1175
> Project: Giraph
>  Issue Type: Bug
>Reporter: Dionysios Logothetis
>Assignee: Dionysios Logothetis
>Priority: Major
>
> Calling {{partition.getEdgeCount()}} iteration on all the vertices of the 
> partition, which can be expensive. The expression inside {{checkNotNull}} is 
> always evaluated, making this expensive. This constructs the string only if 
> necessary.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (GIRAPH-1149) Fix initialization of IdAndValueArrayEdges and IdAndNullArrayEdges

2018-03-23 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1149.
--
Resolution: Fixed

> Fix initialization of IdAndValueArrayEdges and IdAndNullArrayEdges
> --
>
> Key: GIRAPH-1149
> URL: https://issues.apache.org/jira/browse/GIRAPH-1149
> Project: Giraph
>  Issue Type: Bug
>Reporter: Dionysios Logothetis
>Assignee: Dionysios Logothetis
>Priority: Major
>
> The initialize() method for these implementations does not reset the 
> underlying data structure (array) just like in other implementations (e.g. 
> HashMapEdges). This introduces bugs when the OutEdges implementation is 
> re-used during input.
> https://github.com/apache/giraph/pull/40



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (GIRAPH-1167) Add Long2ByteHashMapEdges implementation

2018-03-23 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1167.
--
Resolution: Fixed
  Assignee: Dionysios Logothetis

> Add Long2ByteHashMapEdges implementation
> 
>
> Key: GIRAPH-1167
> URL: https://issues.apache.org/jira/browse/GIRAPH-1167
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Dionysios Logothetis
>Assignee: Dionysios Logothetis
>Priority: Minor
>
> Memory efficient OutEdges implementation to hold long IDs and byte values. 
> This is similar to the existing Long2DoubleHashMapEdges implementation.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (GIRAPH-1154) Improve message printed for super-vertices

2018-03-23 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1154.
--
Resolution: Fixed
  Assignee: Dionysios Logothetis

> Improve message printed for super-vertices
> --
>
> Key: GIRAPH-1154
> URL: https://issues.apache.org/jira/browse/GIRAPH-1154
> Project: Giraph
>  Issue Type: Improvement
>Reporter: Dionysios Logothetis
>Assignee: Dionysios Logothetis
>Priority: Minor
>
> When a job fails due to super-vertices the message printed does not explain 
> to the users how to set the giraph.useBigDataIOForMessages option. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (GIRAPH-1164) Set mapred.job.tracker/mapred.local.dir options in InternalVertexRunner

2018-03-23 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1164.
--
Resolution: Fixed
  Assignee: Dionysios Logothetis

> Set mapred.job.tracker/mapred.local.dir options in InternalVertexRunner
> ---
>
> Key: GIRAPH-1164
> URL: https://issues.apache.org/jira/browse/GIRAPH-1164
> Project: Giraph
>  Issue Type: Improvement
>Reporter: Dionysios Logothetis
>Assignee: Dionysios Logothetis
>Priority: Trivial
>
> The mapred.job.tracker and mapred.local.dir are always expected when the 
> InternalVertexRunner is used but they are not set explicitly. Instead, they 
> are expected to be loaded from an external options file (or passed as 
> parameters from the tests).  Setting them explicitly while stile allowing 
> them to be overriden makes more sense.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (GIRAPH-1182) Log hostname which we disconnected from

2018-03-22 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1182:


 Summary: Log hostname which we disconnected from
 Key: GIRAPH-1182
 URL: https://issues.apache.org/jira/browse/GIRAPH-1182
 Project: Giraph
  Issue Type: Improvement
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


When we can't make a connection, log hostname which was causing the problem



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (GIRAPH-1174) Support having GcObservers

2018-03-09 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1174:


 Summary: Support having GcObservers
 Key: GIRAPH-1174
 URL: https://issues.apache.org/jira/browse/GIRAPH-1174
 Project: Giraph
  Issue Type: New Feature
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (GIRAPH-1171) Collect stats about how long it took to process each partition

2018-02-16 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1171:


 Summary: Collect stats about how long it took to process each 
partition
 Key: GIRAPH-1171
 URL: https://issues.apache.org/jira/browse/GIRAPH-1171
 Project: Giraph
  Issue Type: Improvement
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


In order to make it easier to analyze whether there are some vertices in the 
graph which slow down the computation, or processing times of partitions is 
imbalanced, expose the stats about how long it took for each partition to be 
processed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (GIRAPH-1169) Expose jobGotAllMappers callback in DefaultJobProgressTracker

2018-02-12 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1169:


 Summary: Expose jobGotAllMappers callback in 
DefaultJobProgressTracker
 Key: GIRAPH-1169
 URL: https://issues.apache.org/jira/browse/GIRAPH-1169
 Project: Giraph
  Issue Type: Improvement
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (GIRAPH-1166) Allow MasterObserver to get superstep aggregated metrics

2018-01-10 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1166:


 Summary: Allow MasterObserver to get superstep aggregated metrics
 Key: GIRAPH-1166
 URL: https://issues.apache.org/jira/browse/GIRAPH-1166
 Project: Giraph
  Issue Type: Improvement
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo
Priority: Minor


Pass superstep AggregatedMetrics to MasterObserver, to be able to analyze eg 
stragglers in jobs.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (GIRAPH-1165) Skip iterating through vertices in supersteps with just global logic

2017-11-07 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1165:


 Summary: Skip iterating through vertices in supersteps with just 
global logic
 Key: GIRAPH-1165
 URL: https://issues.apache.org/jira/browse/GIRAPH-1165
 Project: Giraph
  Issue Type: Improvement
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo
Priority: Minor


Some supersteps don't do anything with vertices but just do global worker or 
master computation or perform aggregation. Not iterating through vertices in 
these cases can save time (some time is still spent in zookeeper barrier but 
that can be addressed separately).



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (GIRAPH-1159) DefaultJobProgressTrackerService: Only kill the job if it's still running

2017-09-15 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1159:


 Summary: DefaultJobProgressTrackerService: Only kill the job if 
it's still running
 Key: GIRAPH-1159
 URL: https://issues.apache.org/jira/browse/GIRAPH-1159
 Project: Giraph
  Issue Type: Bug
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


In killJobWithMessage, we need to check if the job has completed before killing 
it.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (GIRAPH-1157) Allow implementations of JobProgressTrackerService to extend kill job behavior

2017-09-07 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1157:


 Summary: Allow implementations of JobProgressTrackerService to 
extend kill job behavior
 Key: GIRAPH-1157
 URL: https://issues.apache.org/jira/browse/GIRAPH-1157
 Project: Giraph
  Issue Type: Improvement
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo
Priority: Minor






--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (GIRAPH-1148) Connected components - make calculate sizes work with large number of components

2017-05-30 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1148:


 Summary: Connected components - make calculate sizes work with 
large number of components
 Key: GIRAPH-1148
 URL: https://issues.apache.org/jira/browse/GIRAPH-1148
 Project: Giraph
  Issue Type: Improvement
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


Currently if we have a graph with large number of connected components, 
calculating connected components sizes fails because reducer becomes too large. 
Use array of handles instead.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Resolved] (GIRAPH-1147) Store timestamps when various fractions of input were done

2017-05-30 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1147.
--
Resolution: Fixed

> Store timestamps when various fractions of input were done
> --
>
> Key: GIRAPH-1147
> URL: https://issues.apache.org/jira/browse/GIRAPH-1147
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> In order to evaluate how read stragglers affect job performance, add a way to 
> expose timestamps when various fractions of input were done reading through 
> counters.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Resolved] (GIRAPH-1138) Don't wrap exceptions from executor service

2017-05-18 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1138.
--
Resolution: Fixed

> Don't wrap exceptions from executor service
> ---
>
> Key: GIRAPH-1138
> URL: https://issues.apache.org/jira/browse/GIRAPH-1138
> Project: Giraph
>  Issue Type: Improvement
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> In ProgressableUtils.getResultsWithNCallables we wrap exceptions from 
> underlying threads, making logs hard to read. We should re-throw original 
> exception when possible.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Resolved] (GIRAPH-1146) Keep track of number of supersteps when possible

2017-05-18 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1146.
--
Resolution: Fixed

> Keep track of number of supersteps when possible
> 
>
> Key: GIRAPH-1146
> URL: https://issues.apache.org/jira/browse/GIRAPH-1146
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> In many cases we know how many supersteps are there going to be. We can keep 
> track of it and log it with progress.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (GIRAPH-1147) Store timestamps when various fractions of input were done

2017-05-18 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1147:


 Summary: Store timestamps when various fractions of input were done
 Key: GIRAPH-1147
 URL: https://issues.apache.org/jira/browse/GIRAPH-1147
 Project: Giraph
  Issue Type: New Feature
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo
Priority: Minor


In order to evaluate how read stragglers affect job performance, add a way to 
expose timestamps when various fractions of input were done reading through 
counters.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (GIRAPH-1146) Keep track of number of supersteps when possible

2017-05-04 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1146:


 Summary: Keep track of number of supersteps when possible
 Key: GIRAPH-1146
 URL: https://issues.apache.org/jira/browse/GIRAPH-1146
 Project: Giraph
  Issue Type: New Feature
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo
Priority: Minor


In many cases we know how many supersteps are there going to be. We can keep 
track of it and log it with progress.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Resolved] (GIRAPH-1133) Fix JobProgressTracker in OverrideExceptionHandler

2017-05-04 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1133.
--
Resolution: Fixed

> Fix JobProgressTracker in OverrideExceptionHandler
> --
>
> Key: GIRAPH-1133
> URL: https://issues.apache.org/jira/browse/GIRAPH-1133
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> We create OverrideExceptionHandler before JobProgressTracker, so it can't 
> report errors to command line.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Resolved] (GIRAPH-1140) Cleanup temp files in hdfs after job is done

2017-05-04 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1140.
--
Resolution: Fixed

> Cleanup temp files in hdfs after job is done
> 
>
> Key: GIRAPH-1140
> URL: https://issues.apache.org/jira/browse/GIRAPH-1140
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> Currently we are not cleaning up temp files we create in hdfs, fix it.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Resolved] (GIRAPH-1134) Track number of input splits in command line

2017-05-04 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1134.
--
Resolution: Fixed

> Track number of input splits in command line
> 
>
> Key: GIRAPH-1134
> URL: https://issues.apache.org/jira/browse/GIRAPH-1134
> Project: Giraph
>  Issue Type: Improvement
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> The progress we track during input reports how much data have we read, but 
> not how much data there is to read.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Resolved] (GIRAPH-1141) Kill the job if no progress is being made

2017-05-04 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1141.
--
Resolution: Fixed

> Kill the job if no progress is being made
> -
>
> Key: GIRAPH-1141
> URL: https://issues.apache.org/jira/browse/GIRAPH-1141
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> Sometimes jobs can get stuck for various reasons, it's better to have an 
> option to kill them then to keep them running holding resources.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (GIRAPH-1141) Kill the job if no progress is being made

2017-03-31 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1141:


 Summary: Kill the job if no progress is being made
 Key: GIRAPH-1141
 URL: https://issues.apache.org/jira/browse/GIRAPH-1141
 Project: Giraph
  Issue Type: New Feature
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo
Priority: Minor


Sometimes jobs can get stuck for various reasons, it's better to have an option 
to kill them then to keep them running holding resources.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (GIRAPH-1140) Cleanup temp files in hdfs after job is done

2017-03-30 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1140:


 Summary: Cleanup temp files in hdfs after job is done
 Key: GIRAPH-1140
 URL: https://issues.apache.org/jira/browse/GIRAPH-1140
 Project: Giraph
  Issue Type: Bug
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


Currently we are not cleaning up temp files we create in hdfs, fix it.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (GIRAPH-1138) Don't wrap exceptions from executor service

2017-03-22 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1138:


 Summary: Don't wrap exceptions from executor service
 Key: GIRAPH-1138
 URL: https://issues.apache.org/jira/browse/GIRAPH-1138
 Project: Giraph
  Issue Type: Improvement
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo
Priority: Minor


In ProgressableUtils.getResultsWithNCallables we wrap exceptions from 
underlying threads, making logs hard to read. We should re-throw original 
exception when possible.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (GIRAPH-1134) Track number of input splits in command line

2017-03-07 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1134:


 Summary: Track number of input splits in command line
 Key: GIRAPH-1134
 URL: https://issues.apache.org/jira/browse/GIRAPH-1134
 Project: Giraph
  Issue Type: Improvement
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo
Priority: Minor


The progress we track during input reports how much data have we read, but not 
how much data there is to read.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Commented] (GIRAPH-1133) Fix JobProgressTracker in OverrideExceptionHandler

2017-03-03 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15894936#comment-15894936
 ] 

Maja Kabiljo commented on GIRAPH-1133:
--

https://github.com/apache/giraph/pull/22

> Fix JobProgressTracker in OverrideExceptionHandler
> --
>
> Key: GIRAPH-1133
> URL: https://issues.apache.org/jira/browse/GIRAPH-1133
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> We create OverrideExceptionHandler before JobProgressTracker, so it can't 
> report errors to command line.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Created] (GIRAPH-1133) Fix JobProgressTracker in OverrideExceptionHandler

2017-03-03 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1133:


 Summary: Fix JobProgressTracker in OverrideExceptionHandler
 Key: GIRAPH-1133
 URL: https://issues.apache.org/jira/browse/GIRAPH-1133
 Project: Giraph
  Issue Type: Bug
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo
Priority: Minor


We create OverrideExceptionHandler before JobProgressTracker, so it can't 
report errors to command line.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)


[jira] [Resolved] (GIRAPH-1115) Move UncaughtExceptionHandler setup to GraphTaskManager

2016-09-19 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1115.
--
Resolution: Fixed

> Move UncaughtExceptionHandler setup to GraphTaskManager
> ---
>
> Key: GIRAPH-1115
> URL: https://issues.apache.org/jira/browse/GIRAPH-1115
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1114) Expose StatusReporter from workers in blocks framework

2016-09-19 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1114.
--
Resolution: Fixed

> Expose StatusReporter from workers in blocks framework
> --
>
> Key: GIRAPH-1114
> URL: https://issues.apache.org/jira/browse/GIRAPH-1114
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> Sometimes we need to call progress or update status from workers, expose this 
> functionality



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1108) Allow measuring time spent doing GC in some interval

2016-09-19 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1108.
--
Resolution: Fixed

> Allow measuring time spent doing GC in some interval
> 
>
> Key: GIRAPH-1108
> URL: https://issues.apache.org/jira/browse/GIRAPH-1108
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> Sometimes when things are slow, we want to know whether it's because of GC or 
> not. Keep track of last k GC pauses and a way to check how much time since 
> some timestamp was spent doing GC.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1115) Move UncaughtExceptionHandler setup to GraphTaskManager

2016-09-19 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15504408#comment-15504408
 ] 

Maja Kabiljo commented on GIRAPH-1115:
--

https://reviews.facebook.net/D64113

> Move UncaughtExceptionHandler setup to GraphTaskManager
> ---
>
> Key: GIRAPH-1115
> URL: https://issues.apache.org/jira/browse/GIRAPH-1115
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1115) Move UncaughtExceptionHandler setup to GraphTaskManager

2016-09-19 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1115:


 Summary: Move UncaughtExceptionHandler setup to GraphTaskManager
 Key: GIRAPH-1115
 URL: https://issues.apache.org/jira/browse/GIRAPH-1115
 Project: Giraph
  Issue Type: Bug
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo
Priority: Minor






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1114) Expose StatusReporter from workers in blocks framework

2016-09-14 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15491764#comment-15491764
 ] 

Maja Kabiljo commented on GIRAPH-1114:
--

https://reviews.facebook.net/D63999

> Expose StatusReporter from workers in blocks framework
> --
>
> Key: GIRAPH-1114
> URL: https://issues.apache.org/jira/browse/GIRAPH-1114
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> Sometimes we need to call progress or update status from workers, expose this 
> functionality



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1114) Expose StatusReporter from workers in blocks framework

2016-09-14 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1114:


 Summary: Expose StatusReporter from workers in blocks framework
 Key: GIRAPH-1114
 URL: https://issues.apache.org/jira/browse/GIRAPH-1114
 Project: Giraph
  Issue Type: New Feature
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo
Priority: Minor


Sometimes we need to call progress or update status from workers, expose this 
functionality



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1108) Allow measuring time spent doing GC in some interval

2016-08-26 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15439821#comment-15439821
 ] 

Maja Kabiljo commented on GIRAPH-1108:
--

https://reviews.facebook.net/D62727

> Allow measuring time spent doing GC in some interval
> 
>
> Key: GIRAPH-1108
> URL: https://issues.apache.org/jira/browse/GIRAPH-1108
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> Sometimes when things are slow, we want to know whether it's because of GC or 
> not. Keep track of last k GC pauses and a way to check how much time since 
> some timestamp was spent doing GC.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1108) Allow measuring time spent doing GC in some interval

2016-08-26 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1108:


 Summary: Allow measuring time spent doing GC in some interval
 Key: GIRAPH-1108
 URL: https://issues.apache.org/jira/browse/GIRAPH-1108
 Project: Giraph
  Issue Type: New Feature
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo
Priority: Minor


Sometimes when things are slow, we want to know whether it's because of GC or 
not. Keep track of last k GC pauses and a way to check how much time since some 
timestamp was spent doing GC.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1103) Another try to fix jobs getting stuck after channel failure

2016-08-26 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1103.
--
Resolution: Fixed

> Another try to fix jobs getting stuck after channel failure
> ---
>
> Key: GIRAPH-1103
> URL: https://issues.apache.org/jira/browse/GIRAPH-1103
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> With GIRAPH-1087 we see jobs stuck after channel failure less often, but it 
> still happens. There are several additional issues I found: requests failing 
> to send at the first place so they never get retried, callbacks for channel 
> failures not being triggered always.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1105) Fix number of open requests in FacebookConfiguration

2016-08-23 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1105.
--
Resolution: Fixed

> Fix number of open requests in FacebookConfiguration
> 
>
> Key: GIRAPH-1105
> URL: https://issues.apache.org/jira/browse/GIRAPH-1105
> Project: Giraph
>  Issue Type: Improvement
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1107) Allow observers to access job counters

2016-08-23 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1107.
--
Resolution: Fixed

> Allow observers to access job counters
> --
>
> Key: GIRAPH-1107
> URL: https://issues.apache.org/jira/browse/GIRAPH-1107
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> From mapper/master/worker observer we might want to update some job counters 
> for stats. For that we should allow observers to access job context.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1107) Allow observers to access job counters

2016-08-23 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15433349#comment-15433349
 ] 

Maja Kabiljo commented on GIRAPH-1107:
--

https://reviews.facebook.net/D62391

> Allow observers to access job counters
> --
>
> Key: GIRAPH-1107
> URL: https://issues.apache.org/jira/browse/GIRAPH-1107
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> From mapper/master/worker observer we might want to update some job counters 
> for stats. For that we should allow observers to access job context.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1107) Allow observers to access job counters

2016-08-23 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1107:


 Summary: Allow observers to access job counters
 Key: GIRAPH-1107
 URL: https://issues.apache.org/jira/browse/GIRAPH-1107
 Project: Giraph
  Issue Type: New Feature
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo
Priority: Minor


>From mapper/master/worker observer we might want to update some job counters 
>for stats. For that we should allow observers to access job context.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1105) Fix number of open requests in FacebookConfiguration

2016-08-12 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15419569#comment-15419569
 ] 

Maja Kabiljo commented on GIRAPH-1105:
--

https://reviews.facebook.net/D62019

> Fix number of open requests in FacebookConfiguration
> 
>
> Key: GIRAPH-1105
> URL: https://issues.apache.org/jira/browse/GIRAPH-1105
> Project: Giraph
>  Issue Type: Improvement
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1105) Fix number of open requests in FacebookConfiguration

2016-08-12 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1105:


 Summary: Fix number of open requests in FacebookConfiguration
 Key: GIRAPH-1105
 URL: https://issues.apache.org/jira/browse/GIRAPH-1105
 Project: Giraph
  Issue Type: Improvement
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1104) NegativeArraySize exception in BigDataOutput

2016-08-10 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15415912#comment-15415912
 ] 

Maja Kabiljo commented on GIRAPH-1104:
--

This didn't seem to solve the problem, https://reviews.facebook.net/D61839 does 
while following max byte[] size semantics from BigDataIO.

> NegativeArraySize exception in BigDataOutput
> 
>
> Key: GIRAPH-1104
> URL: https://issues.apache.org/jira/browse/GIRAPH-1104
> Project: Giraph
>  Issue Type: Bug
>Reporter: Sergey Edunov
>Assignee: Sergey Edunov
>
> We're seeing this exception in some jobs. Supposedly related to high degree 
> vertices
> Caused by: java.lang.NegativeArraySizeException
>   at 
> org.apache.giraph.utils.UnsafeByteArrayOutputStream.ensureSize(UnsafeByteArrayOutputStream.java:117)
>   at 
> org.apache.giraph.utils.UnsafeByteArrayOutputStream.write(UnsafeByteArrayOutputStream.java:168)
>   at 
> org.apache.giraph.utils.io.BigDataOutput.write(BigDataOutput.java:183)
>   at org.apache.giraph.edge.ByteArrayEdges.write(ByteArrayEdges.java:204)
>   at 
> org.apache.giraph.ooc.data.DiskBackedPartitionStore.writeOutEdges(DiskBackedPartitionStore.java:353)
>   at 
> org.apache.giraph.ooc.data.DiskBackedPartitionStore.offloadInMemoryPartitionData(DiskBackedPartitionStore.java:389)
>   at 
> org.apache.giraph.ooc.data.DiskBackedDataStore.offloadPartitionDataProxy(DiskBackedDataStore.java:294)
>   at 
> org.apache.giraph.ooc.data.DiskBackedPartitionStore.offloadPartitionData(DiskBackedPartitionStore.java:318)
>   at 
> org.apache.giraph.ooc.command.StorePartitionIOCommand.execute(StorePartitionIOCommand.java:55)
>   at 
> org.apache.giraph.ooc.OutOfCoreIOCallable.call(OutOfCoreIOCallable.java:99)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1103) Another try to fix jobs getting stuck after channel failure

2016-08-08 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1103:


 Summary: Another try to fix jobs getting stuck after channel 
failure
 Key: GIRAPH-1103
 URL: https://issues.apache.org/jira/browse/GIRAPH-1103
 Project: Giraph
  Issue Type: Bug
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


With GIRAPH-1087 we see jobs stuck after channel failure less often, but it 
still happens. There are several additional issues I found: requests failing to 
send at the first place so they never get retried, callbacks for channel 
failures not being triggered always.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1087) Retry requests after channel failure

2016-07-19 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1087.
--
Resolution: Fixed

> Retry requests after channel failure
> 
>
> Key: GIRAPH-1087
> URL: https://issues.apache.org/jira/browse/GIRAPH-1087
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> We currently don't have a callback to retry requests after channel failure, 
> and would either wait for request timeout or not retrying request at all at 
> places where we don't wait for open requests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1086) Use pool of byte arrays with InMemoryDataAccessor

2016-07-19 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1086.
--
Resolution: Fixed

> Use pool of byte arrays with InMemoryDataAccessor
> -
>
> Key: GIRAPH-1086
> URL: https://issues.apache.org/jira/browse/GIRAPH-1086
> Project: Giraph
>  Issue Type: Improvement
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> Have a pool of byte arrays with InMemoryDataAccessor, to save on byte array 
> creation and initialization.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1087) Retry requests after channel failure

2016-07-19 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15384399#comment-15384399
 ] 

Maja Kabiljo commented on GIRAPH-1087:
--

https://reviews.facebook.net/D60675

> Retry requests after channel failure
> 
>
> Key: GIRAPH-1087
> URL: https://issues.apache.org/jira/browse/GIRAPH-1087
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> We currently don't have a callback to retry requests after channel failure, 
> and would either wait for request timeout or not retrying request at all at 
> places where we don't wait for open requests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1091) Fix SimpleRangePartitionFactoryTest

2016-07-19 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1091.
--
Resolution: Fixed

> Fix SimpleRangePartitionFactoryTest
> ---
>
> Key: GIRAPH-1091
> URL: https://issues.apache.org/jira/browse/GIRAPH-1091
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
> Fix For: 1.2.0
>
>
> SimpleRangePartitionFactoryTest relied on old logic for calculating number of 
> partitions and got broken with GIRAPH-1082.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1091) Fix SimpleRangePartitionFactoryTest

2016-07-13 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15375470#comment-15375470
 ] 

Maja Kabiljo commented on GIRAPH-1091:
--

https://reviews.facebook.net/D60747

> Fix SimpleRangePartitionFactoryTest
> ---
>
> Key: GIRAPH-1091
> URL: https://issues.apache.org/jira/browse/GIRAPH-1091
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> SimpleRangePartitionFactoryTest relied on old logic for calculating number of 
> partitions and got broken with GIRAPH-1082.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1091) Fix SimpleRangePartitionFactoryTest

2016-07-13 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1091:


 Summary: Fix SimpleRangePartitionFactoryTest
 Key: GIRAPH-1091
 URL: https://issues.apache.org/jira/browse/GIRAPH-1091
 Project: Giraph
  Issue Type: Bug
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo
Priority: Minor


SimpleRangePartitionFactoryTest relied on old logic for calculating number of 
partitions and got broken with GIRAPH-1082.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1090) Allow getting shards of broadcasts in ShardedBroadcastHandle

2016-07-12 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1090:


 Summary: Allow getting shards of broadcasts in 
ShardedBroadcastHandle
 Key: GIRAPH-1090
 URL: https://issues.apache.org/jira/browse/GIRAPH-1090
 Project: Giraph
  Issue Type: Improvement
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo
Priority: Minor


When the value we are reducing / broadcasting in shards is large, sometimes 
it's more efficient to get the shards separately and process them instead of 
getting the globally reduced one. Expose that functionality.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1087) Retry requests after channel failure

2016-07-12 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1087:


 Summary: Retry requests after channel failure
 Key: GIRAPH-1087
 URL: https://issues.apache.org/jira/browse/GIRAPH-1087
 Project: Giraph
  Issue Type: Bug
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


We currently don't have a callback to retry requests after channel failure, and 
would either wait for request timeout or not retrying request at all at places 
where we don't wait for open requests.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1086) Use pool of byte arrays with InMemoryDataAccessor

2016-07-11 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15371888#comment-15371888
 ] 

Maja Kabiljo commented on GIRAPH-1086:
--

https://reviews.facebook.net/D60621

> Use pool of byte arrays with InMemoryDataAccessor
> -
>
> Key: GIRAPH-1086
> URL: https://issues.apache.org/jira/browse/GIRAPH-1086
> Project: Giraph
>  Issue Type: Improvement
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> Have a pool of byte arrays with InMemoryDataAccessor, to save on byte array 
> creation and initialization.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1086) Use pool of byte arrays with InMemoryDataAccessor

2016-07-11 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1086:


 Summary: Use pool of byte arrays with InMemoryDataAccessor
 Key: GIRAPH-1086
 URL: https://issues.apache.org/jira/browse/GIRAPH-1086
 Project: Giraph
  Issue Type: Improvement
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


Have a pool of byte arrays with InMemoryDataAccessor, to save on byte array 
creation and initialization.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1085) Add InMemoryDataAccessor

2016-07-11 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1085.
--
Resolution: Fixed

> Add InMemoryDataAccessor
> 
>
> Key: GIRAPH-1085
> URL: https://issues.apache.org/jira/browse/GIRAPH-1085
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> When we deal with graphs which have a lot of vertices with very little total 
> data associated with them (values + edges) we start experiencing memory 
> problems because of too many objects created, since every vertex has multiple 
> objects associated with it. To solve this problem, we should have a 
> serialized partition representation (current ByteArrayPartition just keeps 
> byte[] per vertex, not per partition). We can leverage the out-of-core 
> infrastructure and just add data accessor which won't be backed by disk but 
> in memory buffers.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1083) Make sure we fail after exception in ooc-io thread happens

2016-07-11 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1083.
--
Resolution: Fixed

> Make sure we fail after exception in ooc-io thread happens
> --
>
> Key: GIRAPH-1083
> URL: https://issues.apache.org/jira/browse/GIRAPH-1083
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> Currently if some exception happens in ooc-io thread the job is left running 
> for long time after the exception. We should make sure we fail early.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1085) Add InMemoryDataAccessor

2016-07-06 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15365188#comment-15365188
 ] 

Maja Kabiljo commented on GIRAPH-1085:
--

https://reviews.facebook.net/D60435

> Add InMemoryDataAccessor
> 
>
> Key: GIRAPH-1085
> URL: https://issues.apache.org/jira/browse/GIRAPH-1085
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> When we deal with graphs which have a lot of vertices with very little total 
> data associated with them (values + edges) we start experiencing memory 
> problems because of too many objects created, since every vertex has multiple 
> objects associated with it. To solve this problem, we should have a 
> serialized partition representation (current ByteArrayPartition just keeps 
> byte[] per vertex, not per partition). We can leverage the out-of-core 
> infrastructure and just add data accessor which won't be backed by disk but 
> in memory buffers.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1085) Add InMemoryDataAccessor

2016-07-06 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1085:


 Summary: Add InMemoryDataAccessor
 Key: GIRAPH-1085
 URL: https://issues.apache.org/jira/browse/GIRAPH-1085
 Project: Giraph
  Issue Type: New Feature
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


When we deal with graphs which have a lot of vertices with very little total 
data associated with them (values + edges) we start experiencing memory 
problems because of too many objects created, since every vertex has multiple 
objects associated with it. To solve this problem, we should have a serialized 
partition representation (current ByteArrayPartition just keeps byte[] per 
vertex, not per partition). We can leverage the out-of-core infrastructure and 
just add data accessor which won't be backed by disk but in memory buffers.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1083) Make sure we fail after exception in ooc-io thread happens

2016-07-01 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15359579#comment-15359579
 ] 

Maja Kabiljo commented on GIRAPH-1083:
--

https://reviews.facebook.net/D60291

> Make sure we fail after exception in ooc-io thread happens
> --
>
> Key: GIRAPH-1083
> URL: https://issues.apache.org/jira/browse/GIRAPH-1083
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> Currently if some exception happens in ooc-io thread the job is left running 
> for long time after the exception. We should make sure we fail early.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1083) Make sure we fail after exception in ooc-io thread happens

2016-07-01 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1083:


 Summary: Make sure we fail after exception in ooc-io thread happens
 Key: GIRAPH-1083
 URL: https://issues.apache.org/jira/browse/GIRAPH-1083
 Project: Giraph
  Issue Type: Bug
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


Currently if some exception happens in ooc-io thread the job is left running 
for long time after the exception. We should make sure we fail early.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1082) Remove limit on the number of partitions

2016-07-01 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1082?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15359063#comment-15359063
 ] 

Maja Kabiljo commented on GIRAPH-1082:
--

https://reviews.facebook.net/D60267

> Remove limit on the number of partitions
> 
>
> Key: GIRAPH-1082
> URL: https://issues.apache.org/jira/browse/GIRAPH-1082
> Project: Giraph
>  Issue Type: Improvement
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> Currently we have a limit on how many partitions we can have because we write 
> all partition information to Zookeeper. We can instead send this information 
> in requests and remove the hard limit.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1082) Remove limit on the number of partitions

2016-07-01 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1082:


 Summary: Remove limit on the number of partitions
 Key: GIRAPH-1082
 URL: https://issues.apache.org/jira/browse/GIRAPH-1082
 Project: Giraph
  Issue Type: Improvement
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


Currently we have a limit on how many partitions we can have because we write 
all partition information to Zookeeper. We can instead send this information in 
requests and remove the hard limit.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1063) Make primitive type generated fixed capacity min heaps

2016-05-17 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15286660#comment-15286660
 ] 

Maja Kabiljo commented on GIRAPH-1063:
--

https://reviews.facebook.net/D58299

> Make primitive type generated fixed capacity min heaps
> --
>
> Key: GIRAPH-1063
> URL: https://issues.apache.org/jira/browse/GIRAPH-1063
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> It's often needed to get top k (key, value) pairs, but existing 
> implementations deal with objects making them inefficient. Make one with 
> primitive types.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1063) Make primitive type generated fixed capacity min heaps

2016-05-17 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1063:


 Summary: Make primitive type generated fixed capacity min heaps
 Key: GIRAPH-1063
 URL: https://issues.apache.org/jira/browse/GIRAPH-1063
 Project: Giraph
  Issue Type: New Feature
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


It's often needed to get top k (key, value) pairs, but existing implementations 
deal with objects making them inefficient. Make one with primitive types.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1062) Page rank in Blocks

2016-05-11 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1062:


 Summary: Page rank in Blocks
 Key: GIRAPH-1062
 URL: https://issues.apache.org/jira/browse/GIRAPH-1062
 Project: Giraph
  Issue Type: New Feature
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


We have some examples of pagerank, but they all have some things missing. Make 
one which will take sinks into account, have convergence checks, support both 
weighted and unweighted graphs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1061) Add Connected Components block factory

2016-05-09 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15277349#comment-15277349
 ] 

Maja Kabiljo commented on GIRAPH-1061:
--

https://reviews.facebook.net/D57951

> Add Connected Components block factory
> --
>
> Key: GIRAPH-1061
> URL: https://issues.apache.org/jira/browse/GIRAPH-1061
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> Add block factory for Connected Components to make it easy to run it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1061) Add Connected Components block factory

2016-05-09 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1061:


 Summary: Add Connected Components block factory
 Key: GIRAPH-1061
 URL: https://issues.apache.org/jira/browse/GIRAPH-1061
 Project: Giraph
  Issue Type: New Feature
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


Add block factory for Connected Components to make it easy to run it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1060) Add combiner to connected components

2016-05-09 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1060.
--
Resolution: Fixed

> Add combiner to connected components
> 
>
> Key: GIRAPH-1060
> URL: https://issues.apache.org/jira/browse/GIRAPH-1060
> Project: Giraph
>  Issue Type: Improvement
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> Connected components should use combiner to make it more efficient and 
> require less memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1060) Add combiner to connected components

2016-05-09 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1060:


 Summary: Add combiner to connected components
 Key: GIRAPH-1060
 URL: https://issues.apache.org/jira/browse/GIRAPH-1060
 Project: Giraph
  Issue Type: Improvement
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


Connected components should use combiner to make it more efficient and require 
less memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1059) Clean up MessageEncodeAndStoreTypes

2016-05-06 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15274519#comment-15274519
 ] 

Maja Kabiljo commented on GIRAPH-1059:
--

https://reviews.facebook.net/D57765

> Clean up MessageEncodeAndStoreTypes
> ---
>
> Key: GIRAPH-1059
> URL: https://issues.apache.org/jira/browse/GIRAPH-1059
> Project: Giraph
>  Issue Type: Improvement
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> Currently names of constants in MessageEncodeAndStoreTypes are confusing and 
> it's hard to know what should we set to get some behavior. Improve the naming 
> and documentation, and put all the logic for choosing the type at the same 
> place.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1059) Clean up MessageEncodeAndStoreTypes

2016-05-06 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1059:


 Summary: Clean up MessageEncodeAndStoreTypes
 Key: GIRAPH-1059
 URL: https://issues.apache.org/jira/browse/GIRAPH-1059
 Project: Giraph
  Issue Type: Improvement
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


Currently names of constants in MessageEncodeAndStoreTypes are confusing and 
it's hard to know what should we set to get some behavior. Improve the naming 
and documentation, and put all the logic for choosing the type at the same 
place.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1036) Allow mappers to fail early on exceptions

2016-05-06 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15274495#comment-15274495
 ] 

Maja Kabiljo commented on GIRAPH-1036:
--

https://reviews.facebook.net/D49143

> Allow mappers to fail early on exceptions
> -
>
> Key: GIRAPH-1036
> URL: https://issues.apache.org/jira/browse/GIRAPH-1036
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> Often when something fails in a mapper we see it stuck until its timeout 
> passes. Digging through this issue I found two root causes:
> - Many threads we are creating were not daemon, preventing process to exit, 
> only main thread should be daemon
> - When calling submit on ExecutorService, exceptions are not propagated back 
> to the caller, unless get is called on the future. In 
> ProgressableUtils.getResultsWithNCallables we were calling get on one by one 
> future, causing us to have to wait for previous futures to finish before 
> getting exception which happened in later one.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1036) Allow mappers to fail early on exceptions

2016-05-06 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1036.
--
Resolution: Fixed

> Allow mappers to fail early on exceptions
> -
>
> Key: GIRAPH-1036
> URL: https://issues.apache.org/jira/browse/GIRAPH-1036
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> Often when something fails in a mapper we see it stuck until its timeout 
> passes. Digging through this issue I found two root causes:
> - Many threads we are creating were not daemon, preventing process to exit, 
> only main thread should be daemon
> - When calling submit on ExecutorService, exceptions are not propagated back 
> to the caller, unless get is called on the future. In 
> ProgressableUtils.getResultsWithNCallables we were calling get on one by one 
> future, causing us to have to wait for previous futures to finish before 
> getting exception which happened in later one.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1053) Log exceptions to command line

2016-05-06 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1053.
--
Resolution: Fixed

> Log exceptions to command line
> --
>
> Key: GIRAPH-1053
> URL: https://issues.apache.org/jira/browse/GIRAPH-1053
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> When we know an exception occurred, log it to command line to make it easier 
> for people running jobs to see what the issue was.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1039) Fix stopping jmap histo thread

2016-05-06 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1039.
--
Resolution: Fixed

> Fix stopping jmap histo thread
> --
>
> Key: GIRAPH-1039
> URL: https://issues.apache.org/jira/browse/GIRAPH-1039
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> Currently if jmap histo frequency is set to long period we end up stuck in 
> the end of the job for a long time waiting on jmap histo thread



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1046) Add a way to synchronize full GC calls across workers

2016-05-06 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1046.
--
Resolution: Fixed

> Add a way to synchronize full GC calls across workers
> -
>
> Key: GIRAPH-1046
> URL: https://issues.apache.org/jira/browse/GIRAPH-1046
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> In applications which use memory more heavily, we can see full GC pauses 
> happening on different workers at different times, and each of these is 
> causing some delay because other workers are often waiting on something from 
> the worker in GC (closing open requests, finishing superstep, etc). Having a 
> way to coordinate when full GCs are called could help them have less effect 
> on job performance.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1058) Fix connection retry logic

2016-05-06 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1058.
--
Resolution: Fixed

> Fix connection retry logic
> --
>
> Key: GIRAPH-1058
> URL: https://issues.apache.org/jira/browse/GIRAPH-1058
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> Currently when we fail to connect to a channel we retry immediately and that 
> retry most often fails. Add a short wait between retries, and improve the 
> check for whether the channel connected successfully.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1058) Fix connection retry logic

2016-04-29 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15264678#comment-15264678
 ] 

Maja Kabiljo commented on GIRAPH-1058:
--

https://reviews.facebook.net/D57447

> Fix connection retry logic
> --
>
> Key: GIRAPH-1058
> URL: https://issues.apache.org/jira/browse/GIRAPH-1058
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> Currently when we fail to connect to a channel we retry immediately and that 
> retry most often fails. Add a short wait between retries, and improve the 
> check for whether the channel connected successfully.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1058) Fix connection retry logic

2016-04-29 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1058:


 Summary: Fix connection retry logic
 Key: GIRAPH-1058
 URL: https://issues.apache.org/jira/browse/GIRAPH-1058
 Project: Giraph
  Issue Type: Bug
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo


Currently when we fail to connect to a channel we retry immediately and that 
retry most often fails. Add a short wait between retries, and improve the check 
for whether the channel connected successfully.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (GIRAPH-1053) Log exceptions to command line

2016-04-18 Thread Maja Kabiljo (JIRA)

[ 
https://issues.apache.org/jira/browse/GIRAPH-1053?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15246845#comment-15246845
 ] 

Maja Kabiljo commented on GIRAPH-1053:
--

https://reviews.facebook.net/D56931

> Log exceptions to command line
> --
>
> Key: GIRAPH-1053
> URL: https://issues.apache.org/jira/browse/GIRAPH-1053
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> When we know an exception occurred, log it to command line to make it easier 
> for people running jobs to see what the issue was.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1053) Log exceptions to command line

2016-04-18 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1053:


 Summary: Log exceptions to command line
 Key: GIRAPH-1053
 URL: https://issues.apache.org/jira/browse/GIRAPH-1053
 Project: Giraph
  Issue Type: New Feature
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo
Priority: Minor


When we know an exception occurred, log it to command line to make it easier 
for people running jobs to see what the issue was.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1052) Fix makeSymmetricUnweighted

2016-04-08 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1052.
--
Resolution: Fixed

> Fix makeSymmetricUnweighted
> ---
>
> Key: GIRAPH-1052
> URL: https://issues.apache.org/jira/browse/GIRAPH-1052
> Project: Giraph
>  Issue Type: Bug
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>
> PrepareGraphPieces.makeSymmetricUnweighted is currently very inefficient for 
> skewed degree graphs, because it reuses set objects based on the number of in 
> edges, but also adds all out edges to the set, so sets which should be small 
> can become huge.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1051) Allow changing class loader in Kryo

2016-04-08 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1051.
--
Resolution: Not A Problem

> Allow changing class loader in Kryo
> ---
>
> Key: GIRAPH-1051
> URL: https://issues.apache.org/jira/browse/GIRAPH-1051
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> Add a way to specify our own Kryo class loader.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (GIRAPH-1050) Add MapperObserver

2016-04-08 Thread Maja Kabiljo (JIRA)

 [ 
https://issues.apache.org/jira/browse/GIRAPH-1050?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maja Kabiljo resolved GIRAPH-1050.
--
Resolution: Fixed

> Add MapperObserver
> --
>
> Key: GIRAPH-1050
> URL: https://issues.apache.org/jira/browse/GIRAPH-1050
> Project: Giraph
>  Issue Type: New Feature
>Reporter: Maja Kabiljo
>Assignee: Maja Kabiljo
>Priority: Minor
>
> Add MapperObserver which will be called once per mapper before anything else 
> happens.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (GIRAPH-1051) Allow changing class loader in Kryo

2016-04-07 Thread Maja Kabiljo (JIRA)
Maja Kabiljo created GIRAPH-1051:


 Summary: Allow changing class loader in Kryo
 Key: GIRAPH-1051
 URL: https://issues.apache.org/jira/browse/GIRAPH-1051
 Project: Giraph
  Issue Type: New Feature
Reporter: Maja Kabiljo
Assignee: Maja Kabiljo
Priority: Minor


Add a way to specify our own Kryo class loader.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


  1   2   3   4   5   6   7   >