Thanks Wei-Chiu. Just join both hdfs and yarn channel. Yes, there is a yarn
channel. There are only 3 members in the yarn channel.
Best,
Yufei
`This is not a contribution`
On Fri, Oct 11, 2019 at 4:35 PM Wei-Chiu Chuang wrote:
> Hi Hadoop devs,
>
> In case you don't know, there is an
+1 for this idea. Thanks Wangda for bringing this up.
Some comments to share:
- Agenda needed to be posted ahead of meeting and welcome any interested
party to contribute to topics.
- We should encourage more people to attend. That's whole point of the
meeting.
- Hopefully, this
[
https://issues.apache.org/jira/browse/YARN-8406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yufei Gu resolved YARN-8406.
Resolution: Duplicate
> Do the improvement to the FSLeafQueue about calculating fair share for a
Yufei Gu created YARN-8185:
--
Summary: Improve log in class DirectoryCollection
Key: YARN-8185
URL: https://issues.apache.org/jira/browse/YARN-8185
Project: Hadoop YARN
Issue Type: Improvement
Yufei Gu created YARN-8184:
--
Summary: Too many metrics if containerLocalizer uses
ReadWriteDiskValidator
Key: YARN-8184
URL: https://issues.apache.org/jira/browse/YARN-8184
Project: Hadoop YARN
Yufei Gu created YARN-8162:
--
Summary: Method DirectoryCollection#verifyDirUsingMkdir isn't
needed anymore
Key: YARN-8162
URL: https://issues.apache.org/jira/browse/YARN-8162
Project: Hadoop YARN
Yufei Gu created YARN-8158:
--
Summary: Document that create tag doesn't work for rule
secondaryGroupExistingQueue
Key: YARN-8158
URL: https://issues.apache.org/jira/browse/YARN-8158
Project: Hadoop YARN
[
https://issues.apache.org/jira/browse/YARN-7968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yufei Gu resolved YARN-7968.
Resolution: Won't Fix
> Reset the queue name in submission context while recovering an applicat
Thanks Lei for working on this!
+1 (non-binding)
- Downloaded the binary tarball and verified the checksum.
- Started a pseudo cluster inside one docker container
- Run Resource Manager with Fair Scheduler
- Verified distributed shell
- Verified mapreduce pi job
- Sanity
Thanks Wangda for working on this!
+1 (non-binding)
- Downloaded the binary tarball and verified the checksum.
- Started a pseudo cluster inside one docker container
- Run Resource Manager with Fair Scheduler
- Verified distributed shell
- Verified mapreduce pi job
- Sanity
Yufei Gu created YARN-8061:
--
Summary: An application may preempt itself in case of minshare
preemption
Key: YARN-8061
URL: https://issues.apache.org/jira/browse/YARN-8061
Project: Hadoop YARN
Yufei Gu created YARN-8059:
--
Summary: Resource type is ignored when FS decide to preempt
Key: YARN-8059
URL: https://issues.apache.org/jira/browse/YARN-8059
Project: Hadoop YARN
Issue Type: Bug
Thanks Eddy!
+1 (non-binding)
- Downloaded the hadoop-3.0.1.tar.gz from
http://home.apache.org/~lei/hadoop-3.0.1-RC1/
- Started a pseudo cluster inside one docker container
- Verified distributed shell
- Verified mapreduce pi job
- Sanity check RM WebUI
Best,
Yufei
On Tue, Mar
Yufei Gu created YARN-8024:
--
Summary: LOG in class MaxRunningAppsEnforcer is initialized with a
faulty class FairScheduler
Key: YARN-8024
URL: https://issues.apache.org/jira/browse/YARN-8024
Project
Yufei Gu created YARN-7968:
--
Summary: Reset the queue name in submission context while
recovering an application
Key: YARN-7968
URL: https://issues.apache.org/jira/browse/YARN-7968
Project: Hadoop YARN
Yufei Gu created YARN-7967:
--
Summary: Better doc and Java doc for Fair Scheduler Queue ACL
Key: YARN-7967
URL: https://issues.apache.org/jira/browse/YARN-7967
Project: Hadoop YARN
Issue Type
Yufei Gu created YARN-7966:
--
Summary: Remove AllocationConfiguration#getQueueAcl and related
unit test
Key: YARN-7966
URL: https://issues.apache.org/jira/browse/YARN-7966
Project: Hadoop YARN
Yufei Gu created YARN-7948:
--
Summary: Enable refreshing maximum allocation for multiple
resource types
Key: YARN-7948
URL: https://issues.apache.org/jira/browse/YARN-7948
Project: Hadoop YARN
Yufei Gu created YARN-7903:
--
Summary: Method getStarvedResourceRequests() only consider the
first encountered resource
Key: YARN-7903
URL: https://issues.apache.org/jira/browse/YARN-7903
Project: Hadoop
Yufei Gu created YARN-7853:
--
Summary: SLS failed to startup due to
java.lang.NoClassDefFoundError
Key: YARN-7853
URL: https://issues.apache.org/jira/browse/YARN-7853
Project: Hadoop YARN
Issue
Yufei Gu created YARN-7705:
--
Summary: Create the container log directory with correct sticky
bit in C code
Key: YARN-7705
URL: https://issues.apache.org/jira/browse/YARN-7705
Project: Hadoop YARN
+1 for this idea. It will provide better isolation for scheduler simulating
to split NM simulators or/and AM simulators to different daemons, which can
be placed to different hosts.
On Wed, Dec 20, 2017 at 2:18 AM Jasson Chenwei
wrote:
> It is an interesting point. So
Yeah, I found the same issue for YARN-7541 and several others. Don't know
how to fix this though.
Best,
Yufei
On Fri, Nov 24, 2017 at 9:18 AM, Sunil G wrote:
> Hello
>
> I am seeing continuous jenkins errors like below.
>
> Modes: MultiJDK Sentinel Jenkins Robot Docker
Yufei Gu created YARN-7449:
--
Summary: Split up class TestYarnClient to TestYarnClient and
TestYarnClientImpl
Key: YARN-7449
URL: https://issues.apache.org/jira/browse/YARN-7449
Project: Hadoop YARN
Yufei Gu created YARN-7413:
--
Summary: Support resource type in SLS
Key: YARN-7413
URL: https://issues.apache.org/jira/browse/YARN-7413
Project: Hadoop YARN
Issue Type: Improvement
Yufei Gu created YARN-7390:
--
Summary: All reservation related test cases failed when
TestYarnClient runs against Fair Scheduler.
Key: YARN-7390
URL: https://issues.apache.org/jira/browse/YARN-7390
Project
Yufei Gu created YARN-7363:
--
Summary: ContainerLocalizer don't have a valid log4j config in
case of Linux container executor
Key: YARN-7363
URL: https://issues.apache.org/jira/browse/YARN-7363
Project
[
https://issues.apache.org/jira/browse/YARN-4859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yufei Gu resolved YARN-4859.
Resolution: Works for Me
> [Bug] Unable to submit a job to a reservation when using FairSchedu
Yufei Gu created YARN-7348:
--
Summary: Ignore the vcore in reservation request for fair policy
queue
Key: YARN-7348
URL: https://issues.apache.org/jira/browse/YARN-7348
Project: Hadoop YARN
Issue
Yufei Gu created YARN-7342:
--
Summary: Application page doesn't show correct metrics for
reservation runs
Key: YARN-7342
URL: https://issues.apache.org/jira/browse/YARN-7342
Project: Hadoop YARN
Yufei Gu created YARN-7340:
--
Summary: Missing the time stamp in exception message in Class
NoOverCommitPolicy
Key: YARN-7340
URL: https://issues.apache.org/jira/browse/YARN-7340
Project: Hadoop YARN
Yufei Gu created YARN-7315:
--
Summary: Reconsider "Public" API in SchedulingPolicy of Fair
Scheduler
Key: YARN-7315
URL: https://issues.apache.org/jira/browse/YARN-7315
Project: Hadoop YARN
Yufei Gu created YARN-7311:
--
Summary: TestRMWebServicesReservation doesn't really test fair
scheduler
Key: YARN-7311
URL: https://issues.apache.org/jira/browse/YARN-7311
Project: Hadoop YARN
Issue
Yufei Gu created YARN-7306:
--
Summary: Set default RPC timeout to 5 minutes
Key: YARN-7306
URL: https://issues.apache.org/jira/browse/YARN-7306
Project: Hadoop YARN
Issue Type: Improvement
Yufei Gu created YARN-7291:
--
Summary: Better input parsing for resource in allocation file
Key: YARN-7291
URL: https://issues.apache.org/jira/browse/YARN-7291
Project: Hadoop YARN
Issue Type
[
https://issues.apache.org/jira/browse/YARN-7280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yufei Gu resolved YARN-7280.
Resolution: Not A Problem
> Rescan fair-scheduler.xml every n seco
Yufei Gu created YARN-7270:
--
Summary: Resource#getVirtualCores() does unsafe casting from long
to int.
Key: YARN-7270
URL: https://issues.apache.org/jira/browse/YARN-7270
Project: Hadoop YARN
Yufei Gu created YARN-7263:
--
Summary: Check host name resolution performance when resource
manager starts up
Key: YARN-7263
URL: https://issues.apache.org/jira/browse/YARN-7263
Project: Hadoop YARN
Yufei Gu created YARN-7261:
--
Summary: Add debug message in class FSDownload for better download
latency monitoring
Key: YARN-7261
URL: https://issues.apache.org/jira/browse/YARN-7261
Project: Hadoop YARN
Yufei Gu created YARN-7229:
--
Summary: Add the metric for size of event queue in AsyncDispatcher
Key: YARN-7229
URL: https://issues.apache.org/jira/browse/YARN-7229
Project: Hadoop YARN
Issue Type
Yufei Gu created YARN-7222:
--
Summary: Merge
org.apache.hadoop.yarn.server.resourcemanager.NodeManager with MockNM
Key: YARN-7222
URL: https://issues.apache.org/jira/browse/YARN-7222
Project: Hadoop YARN
Yufei Gu created YARN-7211:
--
Summary: Task in SLS does't work
Key: YARN-7211
URL: https://issues.apache.org/jira/browse/YARN-7211
Project: Hadoop YARN
Issue Type: Bug
Components
Yufei Gu created YARN-7207:
--
Summary: Cache the local host name when getting application list
in RM
Key: YARN-7207
URL: https://issues.apache.org/jira/browse/YARN-7207
Project: Hadoop YARN
Issue
It would be very helpful for testing the RC. To vote a RC, committers and
PMCs usually spend lots of time to compile, deploy the RC, do several
sanity tests, then +1 for the RC. The docker image potentially saves the
compilation and deployment time, and people can do more tests.
Best,
Yufei
On
Yufei Gu created YARN-7180:
--
Summary: Remove class ResourceType
Key: YARN-7180
URL: https://issues.apache.org/jira/browse/YARN-7180
Project: Hadoop YARN
Issue Type: Sub-task
Components
Yufei Gu created YARN-7045:
--
Summary: Remove FSLeafQueue#addAppSchedulable
Key: YARN-7045
URL: https://issues.apache.org/jira/browse/YARN-7045
Project: Hadoop YARN
Issue Type: Improvement
Yufei Gu created YARN-6971:
--
Summary: Clean up different ways to create resources
Key: YARN-6971
URL: https://issues.apache.org/jira/browse/YARN-6971
Project: Hadoop YARN
Issue Type: Improvement
Yufei Gu created YARN-6969:
--
Summary: Remove method getMinShareMemoryFraction and
getPendingContainers in class FairSchedulerQueueInfo
Key: YARN-6969
URL: https://issues.apache.org/jira/browse/YARN-6969
[
https://issues.apache.org/jira/browse/YARN-6926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yufei Gu resolved YARN-6926.
Resolution: Invalid
> FSSchedulerNode reservation confl
[
https://issues.apache.org/jira/browse/YARN-6944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yufei Gu resolved YARN-6944.
Resolution: Duplicate
> The comment about ResourceManager#createPolicyMonitors l
Yufei Gu created YARN-6954:
--
Summary: Remove interface PreemptableResourceScheduler
Key: YARN-6954
URL: https://issues.apache.org/jira/browse/YARN-6954
Project: Hadoop YARN
Issue Type: Improvement
Yufei Gu created YARN-6952:
--
Summary: Enable scheduling monitor in FS
Key: YARN-6952
URL: https://issues.apache.org/jira/browse/YARN-6952
Project: Hadoop YARN
Issue Type: Improvement
Set log level to DEBUG to enable it.
There are several ways to set the log level of a class. My favorite way is
to change it by visiting https://RM-address:port/logLevel.
Best,
Yufei
On Thu, Aug 3, 2017 at 4:38 PM, Jasson Chenwei
wrote:
> hi, all
>
> I found a lot of
Yufei Gu created YARN-6944:
--
Summary: The comment about ResourceManager#createPolicyMonitors
lies
Key: YARN-6944
URL: https://issues.apache.org/jira/browse/YARN-6944
Project: Hadoop YARN
Issue
Yufei Gu created YARN-6941:
--
Summary: Allow Queue placement policies to be ordered by attribute
Key: YARN-6941
URL: https://issues.apache.org/jira/browse/YARN-6941
Project: Hadoop YARN
Issue Type
[
https://issues.apache.org/jira/browse/YARN-6793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yufei Gu resolved YARN-6793.
Resolution: Duplicate
> Duplicated reservation in Fair Scheduler preempt
Yufei Gu created YARN-6845:
--
Summary: Variable scheduler in FSLeafQueue duplicate the one in
FSQueue
Key: YARN-6845
URL: https://issues.apache.org/jira/browse/YARN-6845
Project: Hadoop YARN
Issue
Yufei Gu created YARN-6823:
--
Summary: TestRMRestart#testRMRestartWaitForPreviousAMToFinish
consistently fails in PreCommit build
Key: YARN-6823
URL: https://issues.apache.org/jira/browse/YARN-6823
Project
Yufei Gu created YARN-6799:
--
Summary: Remove the duplicated code in CGroupsHandlerImp.java
Key: YARN-6799
URL: https://issues.apache.org/jira/browse/YARN-6799
Project: Hadoop YARN
Issue Type: Bug
Yufei Gu created YARN-6793:
--
Summary: Duplicated reservation in Fair Scheduler preemption
Key: YARN-6793
URL: https://issues.apache.org/jira/browse/YARN-6793
Project: Hadoop YARN
Issue Type: Bug
Yufei Gu created YARN-6764:
--
Summary: Simplify the logic in FairScheduler#attemptScheduling
Key: YARN-6764
URL: https://issues.apache.org/jira/browse/YARN-6764
Project: Hadoop YARN
Issue Type
Yufei Gu created YARN-6758:
--
Summary: Add elapsed time for SLS metrics
Key: YARN-6758
URL: https://issues.apache.org/jira/browse/YARN-6758
Project: Hadoop YARN
Issue Type: Sub-task
Yufei Gu created YARN-6729:
--
Summary: NM percentage-physical-cpu-limit should be always 100 if
DefaultLCEResourcesHandler is used
Key: YARN-6729
URL: https://issues.apache.org/jira/browse/YARN-6729
Project
Yufei Gu created YARN-6685:
--
Summary: Add job count in to SLS JSON input format
Key: YARN-6685
URL: https://issues.apache.org/jira/browse/YARN-6685
Project: Hadoop YARN
Issue Type: Sub-task
[
https://issues.apache.org/jira/browse/YARN-6644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yufei Gu resolved YARN-6644.
Resolution: Duplicate
> The demand of FSAppAttempt may be negat
Yufei Gu created YARN-6625:
--
Summary: yarn application -list returns a tracking URL for AM that
doesn't work in secured and HA environment
Key: YARN-6625
URL: https://issues.apache.org/jira/browse/YARN-6625
[
https://issues.apache.org/jira/browse/YARN-6581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yufei Gu resolved YARN-6581.
Resolution: Duplicate
> Function ContainersMonitorImpl.MonitoringThread#run() is too l
Yufei Gu created YARN-6581:
--
Summary: Function length of MonitoringThread#run() is too long
Key: YARN-6581
URL: https://issues.apache.org/jira/browse/YARN-6581
Project: Hadoop YARN
Issue Type
Yufei Gu created YARN-6580:
--
Summary: Incorrect LOG for FairSharePolicy
Key: YARN-6580
URL: https://issues.apache.org/jira/browse/YARN-6580
Project: Hadoop YARN
Issue Type: Bug
Components
Yufei Gu created YARN-6551:
--
Summary: Validate SLS input
Key: YARN-6551
URL: https://issues.apache.org/jira/browse/YARN-6551
Project: Hadoop YARN
Issue Type: Bug
Components: scheduler
Yufei Gu created YARN-6535:
--
Summary: Program need to exit when SLS finishes.
Key: YARN-6535
URL: https://issues.apache.org/jira/browse/YARN-6535
Project: Hadoop YARN
Issue Type: Bug
Yufei Gu created YARN-6522:
--
Summary: Container host should be optional in SLS JSON input file
format
Key: YARN-6522
URL: https://issues.apache.org/jira/browse/YARN-6522
Project: Hadoop YARN
Issue
Yufei Gu created YARN-6506:
--
Summary: Fix the code vulnerability of
org.apache.hadoop.yarn.sls.SLSRunner.simulateInfoMap
Key: YARN-6506
URL: https://issues.apache.org/jira/browse/YARN-6506
Project: Hadoop
Yufei Gu created YARN-6505:
--
Summary: Define the strings used in SLS JSON input file format
Key: YARN-6505
URL: https://issues.apache.org/jira/browse/YARN-6505
Project: Hadoop YARN
Issue Type: Sub
Yufei Gu created YARN-6499:
--
Summary: Remove the doc about Schedulable#redistributeShare()
Key: YARN-6499
URL: https://issues.apache.org/jira/browse/YARN-6499
Project: Hadoop YARN
Issue Type: Task
Yufei Gu created YARN-6498:
--
Summary: The SLS offline mode doesn't work
Key: YARN-6498
URL: https://issues.apache.org/jira/browse/YARN-6498
Project: Hadoop YARN
Issue Type: Bug
Components
Yufei Gu created YARN-6497:
--
Summary: Method length of ResourceManager#serviceInit() is too long
Key: YARN-6497
URL: https://issues.apache.org/jira/browse/YARN-6497
Project: Hadoop YARN
Issue Type
Yufei Gu created YARN-6481:
--
Summary: Yarn top shows negative container number in FS
Key: YARN-6481
URL: https://issues.apache.org/jira/browse/YARN-6481
Project: Hadoop YARN
Issue Type: Bug
[
https://issues.apache.org/jira/browse/YARN-6075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yufei Gu resolved YARN-6075.
Resolution: Won't Fix
> Yarn top for FairScheduler
> --
>
>
Yufei Gu created YARN-6468:
--
Summary: Add
Key: YARN-6468
URL: https://issues.apache.org/jira/browse/YARN-6468
Project: Hadoop YARN
Issue Type: Bug
Reporter: Yufei Gu
Yufei Gu created YARN-6448:
--
Summary: Add back the lock in continuous scheduling while sorting
nodes
Key: YARN-6448
URL: https://issues.apache.org/jira/browse/YARN-6448
Project: Hadoop YARN
Issue
Yufei Gu created YARN-6425:
--
Summary: Move out FS state dump code out of method update()
Key: YARN-6425
URL: https://issues.apache.org/jira/browse/YARN-6425
Project: Hadoop YARN
Issue Type: Bug
Yufei Gu created YARN-6423:
--
Summary: Queue metrics doesn't work for Fair Scheduler in SLS
Key: YARN-6423
URL: https://issues.apache.org/jira/browse/YARN-6423
Project: Hadoop YARN
Issue Type: Sub
Yufei Gu created YARN-6411:
--
Summary: Clean up the overwrite of createDispatcher() in subclass
of MockRM
Key: YARN-6411
URL: https://issues.apache.org/jira/browse/YARN-6411
Project: Hadoop YARN
Yufei Gu created YARN-6383:
--
Summary: Not be able to run SLS with FifoScheduler
Key: YARN-6383
URL: https://issues.apache.org/jira/browse/YARN-6383
Project: Hadoop YARN
Issue Type: New Feature
Yufei Gu created YARN-6372:
--
Summary: Add default value for NM disk validator
Key: YARN-6372
URL: https://issues.apache.org/jira/browse/YARN-6372
Project: Hadoop YARN
Issue Type: Sub-task
Thank Junping for working on this.
I verified the following:
1. Verified the md5 of binary tar ball.
2. Deployed on a 4 node cluster with 3 node managers.
3. Configured fair scheduler
4. Ran Pi job and verified the results.
5. Ran SLS for fair scheduler and capacity scheduler.
All good except
Yufei Gu created YARN-6360:
--
Summary: Prevent FS state dump logger from inheriting parents'
appenders.
Key: YARN-6360
URL: https://issues.apache.org/jira/browse/YARN-6360
Project: Hadoop YARN
[
https://issues.apache.org/jira/browse/YARN-4590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yufei Gu resolved YARN-4590.
Resolution: Duplicate
> SLS(Scheduler Load Simulator) web pages can't load css and js resou
Yufei Gu created YARN-6334:
--
Summary: TestRMFailover#testAutomaticFailover always passes even
RM didn't transition to Standby.
Key: YARN-6334
URL: https://issues.apache.org/jira/browse/YARN-6334
Project
Yufei Gu created YARN-6333:
--
Summary: Improve doc for minSharePreemptionTimeout,
fairSharePreemptionTimeout and fairSharePreemptionThreshold
Key: YARN-6333
URL: https://issues.apache.org/jira/browse/YARN-6333
Yufei Gu created YARN-6331:
--
Summary: Potential flakiness in TestFairScheduler#testDumpState
Key: YARN-6331
URL: https://issues.apache.org/jira/browse/YARN-6331
Project: Hadoop YARN
Issue Type: Bug
Yufei Gu created YARN-6326:
--
Summary: AppAttemptId is null while AM Simulator is trying to
track app in SLS
Key: YARN-6326
URL: https://issues.apache.org/jira/browse/YARN-6326
Project: Hadoop YARN
Yufei Gu created YARN-6324:
--
Summary: The log4j.properties in sample-conf doesn't work well for
SLS
Key: YARN-6324
URL: https://issues.apache.org/jira/browse/YARN-6324
Project: Hadoop YARN
Issue
Yufei Gu created YARN-6317:
--
Summary: Get rid of Resources#multiplyAndRoundDown since it
duplicates Resources#multiply
Key: YARN-6317
URL: https://issues.apache.org/jira/browse/YARN-6317
Project: Hadoop
Yufei Gu created YARN-6307:
--
Summary: Refactor FairShareComparator#compare
Key: YARN-6307
URL: https://issues.apache.org/jira/browse/YARN-6307
Project: Hadoop YARN
Issue Type: Bug
Yufei Gu created YARN-6275:
--
Summary: Fail to show real-time tracking charts in SLS
Key: YARN-6275
URL: https://issues.apache.org/jira/browse/YARN-6275
Project: Hadoop YARN
Issue Type: Bug
Yufei Gu created YARN-6222:
--
Summary: TestFairScheduler.testReservationMetrics is flaky
Key: YARN-6222
URL: https://issues.apache.org/jira/browse/YARN-6222
Project: Hadoop YARN
Issue Type: Bug
[
https://issues.apache.org/jira/browse/YARN-4691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yufei Gu resolved YARN-4691.
Resolution: Duplicate
> Cache resource usage at FSLeafQueue le
Yufei Gu created YARN-6204:
--
Summary: Set UncaughtExceptionHandler for event handling thread in
AsyncDispatcher
Key: YARN-6204
URL: https://issues.apache.org/jira/browse/YARN-6204
Project: Hadoop YARN
1 - 100 of 147 matches
Mail list logo