Unstructured Reading Notes

  • Intelligent, Robust and Trustworthy AI: Managing GenAI Challenges, Next Phase of Hybrid AI Models and Enterprise AI for Mission-Critical Applications, Amit Sheth
    • difficult to detect if text generated
    • RLHF reduce hallucination
    • tweak model to avoid hateful generation
    • neurosymbolic: neural AI combined w/ symbolic cognition AI
      • knowledge graph auxiliary to DL, bring explainability
      • deep infusion
    • what do you have what other do not
  • Scalable -Means Clustering for Large via Seeded Approximate Nearest-Neighbor Search, Jack Spalding-Jamieson
    • k-nearest neighbor w/ approximate radius
      • metric: recall; impossible to guarantee
      • ∃ sublinear heuristic
      • search graph almost universally best
      • other application: quantization, space partitioning
      • initialization does not matter if and large, running Lloyds
    • -means do nearest neighbor search each iteration
    • hierarchical navigable small world (HNSW) perform well
      • start w/ approximate nearest neighbor graph of centroid
      • prune edge to sparsify; randomly sample centroid for higher layer
      • start from highest layer, do greedy local beam search to find best centroid on each layer
      • “seed point” to start search: choosing from previous assignment give big speedup
  • Full Proportional Justified Representation, Jiasen Liu
    • representative selection: majority vs cohesion
    • approval-based multi-winner election
    • justified representation (JR): large cohesive group should have a winner
      • large cohesive group: portion deserve a winner, has common approval
      • but not fair; too weak
    • strong cohesiveness (-cohesive): portion deserve winner, has common approval of candidate
    • weak cohesiveness (weakly -cohesive), common approval of candidate; stronger guarantee
    • non-transferable vs transferable (collective) utility: some voter in group satisfied (stronger) vs all
    • 4 combination of (strong/weak cohesiveness) × (non-transferable/transferable utility)
    • harder (coNP) to verify than finding one solution
    • priceability: voter pay for candidate they approve under constant budget
      • priceability imply FPJR (full proportional justified representation, weak cohesion + transferable utility) by bounding
  • What uniform symmetric distro can a shallow circuit produce, Kewen Wu
    • circuit: combination of logic gate mapping input bit to output bit
    • symmetric: output w/ same Hamming weight are equally likely
    • motivation: compute parity (or something else) of bit w/ shallow circuit → distro sample
      • parity (xor): only e.g. where computation hard but sampling easy
    • impossible to read 1 bit to decode compressed data
    • only 6 case of shallow circuit (local function) mapping from uniform distro to uniform distro
      • equal to at most 2-local
      • cannot output 1 only on 1/3 majority bc deviation from 1/3
      • cannot do majority bc lack of cutoff among either independent group (neighborhood) or independent bit
  • Defense: Efficiency in Privacy-Preserving Computation via Domain Knowledge, Weizhao Jin (advisor: Srivatsan Ravi)
    • homomorphic encryption on selective model parameters (vis mask) to reduce overhead
      • filter by commonly-used formula for parameter sensitivity
      • optimize privacy budget, computed via integration
    • entity resolution: dedup point in 2 dataset
    • simulate ReLU via polynomial function bc HE only support + ×
      • balance degree (slow) and accuracy
    • network path validation: avoid sending info on all node
      • backward propagation to validate path for forward speed assuming few malicious
      • XOR hash w/ zero-knowledge proof each pair
  • I Slept With 100 Men in One Day | Documentary
    • many man would do anything to have sex
    • OnlyFans is a huge money printer but also a slippery slope
  • The Creator Of Elixir - Top Shelf 7
    • Go has huge foot gun in exchange for ability to write lower level code
    • most problem are solved; should just use library
    • Jose Valim see Rust as functional bc explicit mutability
    • Jose think loop hard in immutable language
    • Jose was bitten by PL paper w/ slow implementation, and still does not understand the math in his paper as a coauthor
    • library that cannot evolve w/o breaking backward compatibility is either a lie or poorly designed
  • Credit card interest and interchange fees
    • credit card companies make most money from charging merchant percentage
    • ordinary people pay for the benefit high-end card user get
  • Saagar Enjeti: Trump, MAGA, DOGE, Obama, FDR, JFK, History & Politics | Lex Fridman Podcast #454
    • America spirit highly influenced by Scottish-Irish about not trusting authority, fighting, freedom
    • performance as politician is not very correlated with personal life
    • Washington DC is itself a creature not caring about even the president
    • personnel selection is key bc many decision are made by them
    • lower down can brick president order by delaying
    • US immigration is crowded by relative of citizen
    • managerial revolution (virus): not known what is going on lower down
    • true power show when ruling w/o having the position
    • mainstream media ask dummy question at white house
    • stealing election was common and unhinged in US in the last century
  • Dario Amodei: Anthropic CEO on Claude, AGI & the Future of AI & Humanity | Lex Fridman Podcast #452
    • scaling law: model do better w/ bigger network & training time & data; not seem to stop
    • model learn local pattern, larger scale pattern, then long range correlation, etc. as they grow
    • model feeling dumber likely bc different system prompt, user prompt, luck, or perception
    • hard to make model both helpful/polite and concise/safe
    • targeted regulation is needed to not have people radically against regulation
    • mechanistic interpretability is a green field in AI
    • constitutional AI for RL: AI to rate AI response based on prompt about principle
    • important thing is what ultra good AI can do
    • Amanda Askell is likely the human who talked most to Claude
    • goal is for Claude to be helpful in front of million of people
    • model try to please as many people by default, instead to get best result, encourage it to not care
    • clear prompting: explain concept to model when talking about what you want, w/ example
    • anthropomorphize w/ model: can ask why model refuse to answer
    • use strong word to make model do something, e.g., never ever vs do not
    • not failing signal failure bc not trying hard enough, but need to understand high-risk failure
    • interesting idea: model refuse to talk if boring and just quit
    • best way to measure model ability is bleeding edge research you do
  • Defense: Real-time Multi-Resolution Neural Networks for Hand Simulation, Mianlun Zheng (Advisor: Jernej Barbič)
    • from joint angle, simulate bone, tendon, muscle, vein, nerve, fascia, …
    • application: medical education, robotics, Metaverse, physical therapy (committee)
    • average (interpolate) 6 MRI scan of hand pose by calculating plastic strain & min energy function for equilibrium
    • pattern matching bad: need huge data; no anatomy (dumb)
    • tendon simulation: rod model; attach bone; emulate force from forearm; hook → tunnel to slide through
    • fascia: triangle mesh + cloth solver; fat: FEM simulation; nail: rigid point cloud in skin
    • high accuracy: < 1mm error against MRI
    • real-time neural net challenge: nets in graphics slow; real-world usage need < 1ms; existing libraries overlook small model
    • need: change level of detail (LOD) for Metaverse
    • animation: deform character to vertices
    • joint angle → linear blend skinning → net to reduce residue → real-time output
    • LOD support: more vertex each level; restriction vs prolongation to downsample & upsample
      • go up & down by level
      • each level each region (by unity construction) separate net → small net → efficient
    • further performance: reduce joint by perturbation simulation; PCA reduction for each region
    • hot cache performance: lower output dimension & faster than prior work; much faster on high level
  • Measuring context switching and memory overheads for Linux threads
    • Native POSIX Thread Library (NPTL) & futex several times faster than old Linux thread
    • bearable latency: on his i7-4771 in 2018, multicore context switching took ~1.5µs, same core switching ~2.5µs, launching thread ~5µs, launching process ~22.5µs
      comparison: memcpy 64 KiB took 3µs, Goroutine switching took 170ns
    • lazy memory: 8 MiB virtual via ulimit (VM), ~8 KiB resident (RSS) without touching stack
  • Optimal Protocols for Studying & Learning, Andrew Huberman
    • neuroplasticity apply during fast eye movement sleep
    • focus & alertness is how remember; sleep is prerequisite; meditation, NSDR are tool
    • testing help remember; test ASAP after exposure
      • best be open-ended, short answer, minimal prompt, require critical thinking
      • do not matter for memory if wrong
    • perceived familiarity hinder memory
    • gap effect: pause in talk greatly reinforce memory
  • ChatGPT from Scratch: How to Train an Enterprise AI Assistant • Phil Winder • GOTO 2023
    • post-GPT are decoder-only
    • logistic regression is often superb for domain-specific task
    • RLHF degrade performance
    • parameter-efficient fine-tuning (PEFT): use adapter alongside frozen model
    • quantization massively reduce RAM usage
    • Falcon-7B LLM: used in demo, said to have cleaned training data, open-source
    • fine tuning demo done on V100 in Colab
  • How Smartphones Shrink Our Brains
    • anticipation trump focus
    • habit to rely on tool kill memory
    • trying to split attention make though shallow
    • paper is better for thinking (I think large screen help)