深度剖析JUC中LongAdder类源码

文章目录

- 1.诞生背景
- 2.LongAdder核心思想
- 3.底层实现：
- 4.额外补充

1.诞生背景

LongAdder是JDK8新增的一个原子操作类，和AtomicLong扮演者同样的角色，由于采用AtomicLong 保证多线程数据同步，高并发场景下会导致大量线程同时竞争更新一个原子变量，容易造成大量线程竞争失败后，无线循环不断自旋尝试CAS，极大浪费CPU资源。为了解决这个循环自旋尝试CAS极大占用CPU资源的问题，JDK大佬就创造了LongAdder类

2.LongAdder核心思想

将一个变量拆分成多个变量，高并发场景下让多个线程竞争获取多个资源，用以减少竞争资源冲突，从而提升性能。

本质就是一种分段锁思想，将一个变量分成多段，多线程并发下获取不同分段对象cell不会发生竞争，有效避免大量线程自旋竞争CAS

3.底层实现：

下面结合LongAdder的结构,add() 和sum() 方法对类底层执行进行剖析。
LongAdder是继承于Striped64。其中比较重要的四个参数在下图列出。

// 继承Striped64.
public class LongAdder extends Striped64 implements Serializable {private static final long serialVersionUID = 7249069246863182397L;
}
// Striped64 类中四个实例变量/** Number of CPUS, to place bound on table size */// 当前机器CPU数目static final int NCPU = Runtime.getRuntime().availableProcessors();/*** Table of cells. When non-null, size is a power of 2.*/// 用于存储变量的数组,初始size为2,采用2倍扩容,因为length参与了线程获取cell对象时索引计算transient volatile Cell[] cells;/*** Base value, used mainly when there is no contention, but also as* a fallback during table initialization races. Updated via CAS.*/// 变量基数transient volatile long base;/*** Spinlock (locked via CAS) used when resizing and/or creating Cells.*/// 用于判断是否发生cell竞争状态标识，1表示存在获取cell线程。transient volatile int cellsBusy;

cells数组是用来存储变量值的一部分的集合。Cell结构如下：以JDK21为例子

    /*** Padded variant of AtomicLong supporting only raw accesses plus CAS.** JVM intrinsics note: It would be possible to use a release-only* form of CAS here, if it were provided.*/@jdk.internal.vm.annotation.Contended static final class Cell {volatile long value;Cell(long x) { value = x; }final boolean cas(long cmp, long val) {return VALUE.weakCompareAndSetRelease(this, cmp, val);}final void reset() {VALUE.setVolatile(this, 0L);}final void reset(long identity) {VALUE.setVolatile(this, identity);}final long getAndSet(long val) {return (long)VALUE.getAndSet(this, val);}// VarHandle mechanicsprivate static final VarHandle VALUE;static {try {MethodHandles.Lookup l = MethodHandles.lookup();VALUE = l.findVarHandle(Cell.class, "value", long.class);} catch (ReflectiveOperationException e) {throw new ExceptionInInitializerError(e);}}}

@jdk.internal.vm.annotation.Contended 注解是用以字节填充，用来避免伪共享。这个伪共享在上一篇剖析AQS源码中有讲。点击查看

这个Cell结构很简单，就是使用value变量来存储值。

接着看看sum() 方法：

    /*** Returns the current sum.  The returned value is <em>NOT</em> an* atomic snapshot; invocation in the absence of concurrent* updates returns an accurate result, but concurrent updates that* occur while the sum is being calculated might not be* incorporated.** @return the sum*/public long sum() {Cell[] cs = cells;long sum = base;if (cs != null) {for (Cell c : cs)if (c != null)sum += c.value;}return sum;}

代码逻辑很简单，就是把base的值和cells数组里的值求和，这个就是LongAdder实际值。

继续看add() 方法,这个是整个类最关键的方法：

    /*** Adds the given value.** @param x the value to add*/public void add(long x) {// b为基础值, v为存储当前线程被分配到具体某个cell的value。/** m 当前cell的长度-1 ，由于cell的长度是2的幂数,因此结构必然是`...1111`结尾,index & m 就是用来计算当前线程竞争获取对象cell在cells的位置!(uncontended = c.cas(v = c.value, v + x) cell对象cas失败走longAccumulate(x, null, uncontended, index)逻辑。**/Cell[] cs; long b, v; int m; Cell c;if ((cs = cells) != null || !casBase(b = base, b + x)) {int index = getProbe();boolean uncontended = true;if (cs == null || (m = cs.length - 1) < 0 ||(c = cs[index & m]) == null ||!(uncontended = c.cas(v = c.value, v + x)))longAccumulate(x, null, uncontended, index);}}/*** Returns the probe value for the current thread.* Duplicated from ThreadLocalRandom because of packaging restrictions.*/static final int getProbe() {return (int) THREAD_PROBE.get(Thread.currentThread());}

1.介绍add方法前，先对getProbe()方法进行简要说明，这个可以理解为根据当前线程获取一个唯一id用来计算当前线程参与竞争Cell对象在cells数组中索引位置。
2.假定一个线程就是一个用户，cells中是一个窗口服务列表，cells中的每个cell实例是一个窗口，相关运行流程图如下：

在这里插入图片描述
3.接着来分析下longAccumulate 这个方法：

final void longAccumulate(long x, LongBinaryOperator fn,boolean wasUncontended, int index) {if (index == 0) {ThreadLocalRandom.current(); // force initializationindex = getProbe();wasUncontended = true;}// 默认冲突为falsefor (boolean collide = false;;) {       // True if last slot nonemptyCell[] cs; Cell c; int n; long v;if ((cs = cells) != null && (n = cs.length) > 0) {// 当前cell对象为空if ((c = cs[(n - 1) & index]) == null) {// cellsBusy 为0 此时可以通过自旋竞争获取锁。if (cellsBusy == 0) {       // Try to attach new CellCell r = new Cell(x);   // Optimistically create// cas方式加锁。这个锁在创建对象和扩容时需要加锁。if (cellsBusy == 0 && casCellsBusy()) {try {               // Recheck under lockCell[] rs; int m, j;// 二次检查。确保对象索引位置为null在执行赋值操作。// 这个和懒加载单例模式DoubleCheck 思想一直。if ((rs = cells) != null &&(m = rs.length) > 0 &&rs[j = (m - 1) & index] == null) {//  将新建的cell对象r赋值给指定位置。rs[j] = r;break;}} finally {// 锁释放cellsBusy = 0;}continue;           // Slot is now non-empty}}collide = false;}// cas执行失败 设置true重新再执行else if (!wasUncontended)       // CAS already known to failwasUncontended = true;      // Continue after rehash// 当前cell 存在 则执行CAS，如果方法fn为null则执行加法操作此时就是LongAdder// 如果传了函数,则调用自定义函数fn。else if (c.cas(v = c.value,(fn == null) ? v + x : fn.applyAsLong(v, x)))break;//  如果当前Cell数组元素个数大于CPU数或者已经完成扩容，则冲突为false。else if (n >= NCPU || cells != cs)collide = false;            // At max size or stale// 如果以上判断均不满足，则是存在冲突的。设置为true。else if (!collide)collide = true;// 如果当前n没有到达cpu个数且存在冲突。// 尝试扩容。cas机制扩容加锁，避免多个线程都进行扩容操作。else if (cellsBusy == 0 && casCellsBusy()) {try {if (cells == cs)        // Expand table unless stalecells = Arrays.copyOf(cs, n << 1); // 必须是2倍扩容} finally {// 锁释放cellsBusy = 0;}// 重新设置冲突为false。collide = false;continue;                   // Retry with expanded table}index = advanceProbe(index);}// cell数组初始化判断逻辑。else if (cellsBusy == 0 && cells == cs && casCellsBusy()) {try {                           // Initialize tableif (cells == cs) {Cell[] rs = new Cell[2];rs[index & 1] = new Cell(x);cells = rs;break;}} finally {cellsBusy = 0;}}// Fall back on using baseelse if (casBase(v = base,(fn == null) ? v + x : fn.applyAsLong(v, x)))break;}}

index ==0 说明 getProbe() 方法为0，(int)
THREAD_PROBE.get(Thread.currentThread())=0,
则说明当前线程threadLocalRandomProbe=0, 这个是通过反射实现值获取。

在这里插入图片描述
再看Thread类中，关于threadLocalRandomProbe注释，说明ThreadLocalRandom 对象没有初始化，因此才需执行初始化操作。也就是ThreadLocalRandom.current()这个方法。

    /** Probe hash value; nonzero if threadLocalRandomSeed initialized */// 初始化后不能为0。int threadLocalRandomProbe;

由于longAccumulate 代码量太大，关于运行情况，注释都写在代码上。相信大家都能看懂。

4.额外补充

1.LongAdder 和LongAccumulate 之间的关系，longAccumulate 是通用累计计算器，不仅可以实现累加，还可以根据用户自定义函数来实现累计功能，LongAdder 是其中一个特例，相当于就是一个longAccumulate 默认实现。
2.Cell数组的个数和CPU相等，此时性能能得到最大发挥。
3.Cell数组占用内存相对较大，一开始是null，只有在使用到Cell数组才会创建，惰性加载/懒加载方式。
4。应用场景：适用于高并发累计统计计数场景，不适用于单线程、以及多线程下实时获取精准数据的情况。
核心思想分段锁，采用空间换时间的策略，来提升高并发下统计计数的效率。大多数性能优化，都是空间换时间、时间换空间这两者之间做权衡。