NekoDaemon's Blog

Building OpenWrt from Scratch for ARM64 UEFI ACPI VM

Posted on 2022-02-26 Last updated on 2022-02-27

OpenWrt doesn't provide a combined disk image for ARM virtual machines, unlike what they did for x86 VMs. Meanwhile, their official ARM64 kernel release can't boot in UEFI environment. But we can still make it work by compiling it from source and building a disk image manually.

How to resize the root LVM partition of Ubuntu

Posted on 2022-02-26

When we resize the virtual hard disk of a virtual machine or restore a disk image to a larger disk, the free space of the partition detected by Ubuntu will not increase because the partition table is unchanged. In the past, we could easily resize the ext4 root partition with the help of resize2fs. However, things get complex when Ubuntu utilizes LVM partition as their default root partition.

博客年度总结既 Hexo 第三次魔改记录

Posted on 2022-01-31

在除夕前那么几天终于有一点闲空了，早在半年前，我就对原来的博客很不爽了，原来的主题丑到我了，博客系统也废了很久（不过怎么还有人能成功评论了，我自己都不能登录上去）。突然又发现 NexT 主题悄悄换了个仓库，早就更新了一个大版本了，连渲染后端都换成了 Nunjucks 了。总之，是时候爆改我博客的 Remix 主题了。

Unattended Ubuntu 20.04 Server Offline Installation

Posted on 2022-01-16 Last updated on 2022-09-03

Last year, I wrote a post about how to install Ubuntu 18.04 Server automatically. The major reason why I choose to install the older version is I failed to make Ubuntu 20.04 install without pressing any key at that time while the approach for the offline installation recommended by the official is not working.

SC21回顾 - 赢了，但只赢了一点点

Posted on 2022-01-01

SC21又又又又是在线上打的。第二年痛失美帝免费旅游机会了！！！第二年了！！！~~没有机票，酒店，和大吃大喝的比赛能叫比赛吗！~~不过结果还是不错的，远远的超出我的预期（原因请看下文分解）。

Everything you need to know about Splitting NCCL Communicators

Posted on 2021-12-15 Last updated on 2021-12-17

MPI allows to create a new communicator by splitting an existing one into a sub-communicator, which can make our program dynamically select a subset of computing nodes to involve in the collective communication operations, such as all-reduce and all-gather operations. NCCL also has a similar feature, but it is not well-documented yet.

滥用Docker容器当作虚拟机的方法

Posted on 2021-09-06 Last updated on 2021-10-15

把Docker当虚拟机用，虽然真的很不优雅，做出来的镜像又糙又肮脏，但是这真的很方便啊。

个人博客CDN选型和进阶玩法指北

Posted on 2021-08-12

网上几乎所有的文章都直接忽悠上CDN的车，难道上CDN就是提升速度的最优解？在CDN这条弯路上折腾了快两年，玩了一圈免备案的CDN，踩了各种各样的坑以后，恍然大悟，茅厕顿开，便在此大放厥词写下此文。本文主要介绍CDN的正确用法，以及性价比爆炸，便宜又效果好的网站加速方案。

Design of TensorFlow XLA Sharding System

Posted on 2021-08-04 Last updated on 2021-08-05

Recently, a SOTA sharding approach, GSPMD/GShard, was proposed and it provides an intuitive interface to partition a large array on arbitrary dimensions, while utilizing sharding propagation algorithms to automatically infer the partitioning strategy for tensors without user-specified sharding specifications. This document introduces the design and the implementation of XLA Sharding System.

Easy way to debug TensorFlow XLA Compiler using VSCode

Posted on 2021-08-04

It would be easier to read the source code if we are aware of the runtime information, including call stacks and variable values. This tutorial introduces how to utilize our powerful VSCode to trace XLA Compiler.