linux-toradex.git/kernel/workqueue.c, branch v4.6-rc2

Merge branch 'for-4.6' of git://git.kernel.org/pub/scm/linux/kernel/git/tj/wq

2016-03-19T03:05:39+00:00

Pull workqueue updates from Tejun Heo:
 "Three trivial workqueue changes"

* 'for-4.6' of git://git.kernel.org/pub/scm/linux/kernel/git/tj/wq:
  workqueue: Fix comment for work_on_cpu()
  sched/core: Get rid of 'cpu' argument in wq_worker_sleeping()
  workqueue: Replace usage of init_name with dev_set_name()

tags: Fix DEFINE_PER_CPU expansions

2016-03-15T23:55:16+00:00

$ make tags
  GEN     tags
ctags: Warning: drivers/acpi/processor_idle.c:64: null expansion of name pattern "\1"
ctags: Warning: drivers/xen/events/events_2l.c:41: null expansion of name pattern "\1"
ctags: Warning: kernel/locking/lockdep.c:151: null expansion of name pattern "\1"
ctags: Warning: kernel/rcu/rcutorture.c:133: null expansion of name pattern "\1"
ctags: Warning: kernel/rcu/rcutorture.c:135: null expansion of name pattern "\1"
ctags: Warning: kernel/workqueue.c:323: null expansion of name pattern "\1"
ctags: Warning: net/ipv4/syncookies.c:53: null expansion of name pattern "\1"
ctags: Warning: net/ipv6/syncookies.c:44: null expansion of name pattern "\1"
ctags: Warning: net/rds/page.c:45: null expansion of name pattern "\1"

Which are all the result of the DEFINE_PER_CPU pattern:

  scripts/tags.sh:200:	'/\
Acked-by: David S. Miller 
Acked-by: Rafael J. Wysocki 
Cc: Tejun Heo 
Cc: "Paul E. McKenney" 
Signed-off-by: Andrew Morton 
Signed-off-by: Linus Torvalds

workqueue: Fix comment for work_on_cpu()

2016-03-11T17:39:01+00:00

Function is processed in thread context, not in user context.

Cc: Tejun Heo 
Cc: Lai Jiangshan 
Cc: Peter Zijlstra 
Cc: Thomas Gleixner 
Signed-off-by: Anna-Maria Gleixner 
Signed-off-by: Tejun Heo

sched/core: Get rid of 'cpu' argument in wq_worker_sleeping()

2016-03-02T15:28:47+00:00

Given that wq_worker_sleeping() could only be called for a
CPU it is running on, we do not need passing a CPU ID as an
argument.

Suggested-by: Oleg Nesterov 
Cc: Oleg Nesterov 
Cc: Peter Zijlstra 
Signed-off-by: Alexander Gordeev 
Signed-off-by: Tejun Heo

workqueue: Replace usage of init_name with dev_set_name()

2016-02-17T21:14:18+00:00

The init_name property of the device struct is sort of a hack and should
only be used for statically allocated devices. Since the device is
dynamically allocated here it is safe to use the proper way to set a
devices name by calling dev_set_name().

Signed-off-by: Lars-Peter Clausen 
Signed-off-by: Tejun Heo

workqueue: handle NUMA_NO_NODE for unbound pool_workqueue lookup

2016-02-10T17:13:05+00:00

When looking up the pool_workqueue to use for an unbound workqueue,
workqueue assumes that the target CPU is always bound to a valid NUMA
node.  However, currently, when a CPU goes offline, the mapping is
destroyed and cpu_to_node() returns NUMA_NO_NODE.

This has always been broken but hasn't triggered often enough before
874bbfe600a6 ("workqueue: make sure delayed work run in local cpu").
After the commit, workqueue forcifully assigns the local CPU for
delayed work items without explicit target CPU to fix a different
issue.  This widens the window where CPU can go offline while a
delayed work item is pending causing delayed work items dispatched
with target CPU set to an already offlined CPU.  The resulting
NUMA_NO_NODE mapping makes workqueue try to queue the work item on a
NULL pool_workqueue and thus crash.

While 874bbfe600a6 has been reverted for a different reason making the
bug less visible again, it can still happen.  Fix it by mapping
NUMA_NO_NODE to the default pool_workqueue from unbound_pwq_by_node().
This is a temporary workaround.  The long term solution is keeping CPU
-> NODE mapping stable across CPU off/online cycles which is being
worked on.

Signed-off-by: Tejun Heo 
Reported-by: Mike Galbraith 
Cc: Tang Chen 
Cc: Rafael J. Wysocki 
Cc: Len Brown 
Cc: stable@vger.kernel.org
Link: http://lkml.kernel.org/g/1454424264.11183.46.camel@gmail.com
Link: http://lkml.kernel.org/g/1453702100-2597-1-git-send-email-tangchen@cn.fujitsu.com

workqueue: implement "workqueue.debug_force_rr_cpu" debug feature

2016-02-09T22:59:38+00:00

Workqueue used to guarantee local execution for work items queued
without explicit target CPU.  The guarantee is gone now which can
break some usages in subtle ways.  To flush out those cases, this
patch implements a debug feature which forces round-robin CPU
selection for all such work items.

The debug feature defaults to off and can be enabled with a kernel
parameter.  The default can be flipped with a debug config option.

If you hit this commit during bisection, please refer to 041bd12e272c
("Revert "workqueue: make sure delayed work run in local cpu"") for
more information and ping me.

Signed-off-by: Tejun Heo

workqueue: schedule WORK_CPU_UNBOUND work on wq_unbound_cpumask CPUs

2016-02-09T22:59:38+00:00

WORK_CPU_UNBOUND work items queued to a bound workqueue always run
locally.  This is a good thing normally, but not when the user has
asked us to keep unbound work away from certain CPUs.  Round robin
these to wq_unbound_cpumask CPUs instead, as perturbation avoidance
trumps performance.

tj: Cosmetic and comment changes.  WARN_ON_ONCE() dropped from empty
    (wq_unbound_cpumask AND cpu_online_mask).  If we want that, it
    should be done when config changes.

Signed-off-by: Mike Galbraith 
Signed-off-by: Tejun Heo

Revert "workqueue: make sure delayed work run in local cpu"

2016-02-09T21:11:26+00:00

This reverts commit 874bbfe600a660cba9c776b3957b1ce393151b76.

Workqueue used to implicity guarantee that work items queued without
explicit CPU specified are put on the local CPU.  Recent changes in
timer broke the guarantee and led to vmstat breakage which was fixed
by 176bed1de5bf ("vmstat: explicitly schedule per-cpu work on the CPU
we need it to run on").

vmstat is the most likely to expose the issue and it's quite possible
that there are other similar problems which are a lot more difficult
to trigger.  As a preventive measure, 874bbfe600a6 ("workqueue: make
sure delayed work run in local cpu") was applied to restore the local
CPU guarnatee.  Unfortunately, the change exposed a bug in timer code
which got fixed by 22b886dd1018 ("timers: Use proper base migration in
add_timer_on()").  Due to code restructuring, the commit couldn't be
backported beyond certain point and stable kernels which only had
874bbfe600a6 started crashing.

The local CPU guarantee was accidental more than anything else and we
want to get rid of it anyway.  As, with the vmstat case fixed,
874bbfe600a6 is causing more problems than it's fixing, it has been
decided to take the chance and officially break the guarantee by
reverting the commit.  A debug feature will be added to force foreign
CPU assignment to expose cases relying on the guarantee and fixes for
the individual cases will be backported to stable as necessary.

Signed-off-by: Tejun Heo 
Fixes: 874bbfe600a6 ("workqueue: make sure delayed work run in local cpu")
Link: http://lkml.kernel.org/g/20160120211926.GJ10810@quack.suse.cz
Cc: stable@vger.kernel.org
Cc: Mike Galbraith 
Cc: Henrique de Moraes Holschuh 
Cc: Daniel Bilik 
Cc: Jan Kara 
Cc: Shaohua Li 
Cc: Sasha Levin 
Cc: Ben Hutchings 
Cc: Thomas Gleixner 
Cc: Daniel Bilik 
Cc: Jiri Slaby 
Cc: Michal Hocko

workqueue: skip flush dependency checks for legacy workqueues

2016-01-29T18:31:10+00:00

fca839c00a12 ("workqueue: warn if memory reclaim tries to flush
!WQ_MEM_RECLAIM workqueue") implemented flush dependency warning which
triggers if a PF_MEMALLOC task or WQ_MEM_RECLAIM workqueue tries to
flush a !WQ_MEM_RECLAIM workquee.

This assumes that workqueues marked with WQ_MEM_RECLAIM sit in memory
reclaim path and making it depend on something which may need more
memory to make forward progress can lead to deadlocks.  Unfortunately,
workqueues created with the legacy create*_workqueue() interface
always have WQ_MEM_RECLAIM regardless of whether they are depended
upon memory reclaim or not.  These spurious WQ_MEM_RECLAIM markings
cause spurious triggering of the flush dependency checks.

  WARNING: CPU: 0 PID: 6 at kernel/workqueue.c:2361 check_flush_dependency+0x138/0x144()
  workqueue: WQ_MEM_RECLAIM deferwq:deferred_probe_work_func is flushing !WQ_MEM_RECLAIM events:lru_add_drain_per_cpu
  ...
  Workqueue: deferwq deferred_probe_work_func
  [] (unwind_backtrace) from [] (show_stack+0x10/0x14)
  [] (show_stack) from [] (dump_stack+0x94/0xd4)
  [] (dump_stack) from [] (warn_slowpath_common+0x80/0xb0)
  [] (warn_slowpath_common) from [] (warn_slowpath_fmt+0x30/0x40)
  [] (warn_slowpath_fmt) from [] (check_flush_dependency+0x138/0x144)
  [] (check_flush_dependency) from [] (flush_work+0x50/0x15c)
  [] (flush_work) from [] (lru_add_drain_all+0x130/0x180)
  [] (lru_add_drain_all) from [] (migrate_prep+0x8/0x10)
  [] (migrate_prep) from [] (alloc_contig_range+0xd8/0x338)
  [] (alloc_contig_range) from [] (cma_alloc+0xe0/0x1ac)
  [] (cma_alloc) from [] (__alloc_from_contiguous+0x38/0xd8)
  [] (__alloc_from_contiguous) from [] (__dma_alloc+0x240/0x278)
  [] (__dma_alloc) from [] (arm_dma_alloc+0x54/0x5c)
  [] (arm_dma_alloc) from [] (dmam_alloc_coherent+0xc0/0xec)
  [] (dmam_alloc_coherent) from [] (ahci_port_start+0x150/0x1dc)
  [] (ahci_port_start) from [] (ata_host_start.part.3+0xc8/0x1c8)
  [] (ata_host_start.part.3) from [] (ata_host_activate+0x50/0x148)
  [] (ata_host_activate) from [] (ahci_host_activate+0x44/0x114)
  [] (ahci_host_activate) from [] (ahci_platform_init_host+0x1d8/0x3c8)
  [] (ahci_platform_init_host) from [] (tegra_ahci_probe+0x448/0x4e8)
  [] (tegra_ahci_probe) from [] (platform_drv_probe+0x50/0xac)
  [] (platform_drv_probe) from [] (driver_probe_device+0x214/0x2c0)
  [] (driver_probe_device) from [] (bus_for_each_drv+0x60/0x94)
  [] (bus_for_each_drv) from [] (__device_attach+0xb0/0x114)
  [] (__device_attach) from [] (bus_probe_device+0x84/0x8c)
  [] (bus_probe_device) from [] (deferred_probe_work_func+0x68/0x98)
  [] (deferred_probe_work_func) from [] (process_one_work+0x120/0x3f8)
  [] (process_one_work) from [] (worker_thread+0x38/0x55c)
  [] (worker_thread) from [] (kthread+0xdc/0xf4)
  [] (kthread) from [] (ret_from_fork+0x14/0x3c)

Fix it by marking workqueues created via create*_workqueue() with
__WQ_LEGACY and disabling flush dependency checks on them.

Signed-off-by: Tejun Heo 
Reported-and-tested-by: Thierry Reding 
Link: http://lkml.kernel.org/g/20160126173843.GA11115@ulmo.nvidia.com
Fixes: fca839c00a12 ("workqueue: warn if memory reclaim tries to flush !WQ_MEM_RECLAIM workqueue")