Ask HN: What's your best resource for keeping up-to-date on AI developments?

ulrikhansen54 · on April 7, 2023

https://paperswithcode.com/ is arguably the best source and overview of all the research. Its also (somewhat) unbiased (owned by Meta), not being an SEO-optimised company blog.

Jaxan · on April 7, 2023

I am not sure reading 25 papers per day is a good way of staying up to date.

ReDeiPirati · on April 7, 2023

+1 to this, plus how many of those papers are pure bullsh*t? In the last years, I saw lot of crap just for the sake of publishing in AI.

reaperman · on April 7, 2023

A lot of the latest high-performing models aren't making it to HN. I use paperswithcode by scanning major tasks for new models that come out #1 across multiple benchmarks and then reading those papers.

BeIT-3 for example.

pixl97 · on April 7, 2023

Reminds me of a reddit meme I saw a few weeks ago of a picture of a boxer trying to keep up with the rate of change and getting tired out, then saying "damn, singularity got hands"

danesparza · on April 7, 2023

C'mon. That's only 4 papers an hour!

:-)

brudgers · on April 7, 2023

The average academic paper is about 12-14 pages including citations.

So on average probably 50 pages an hour.

With practice, that's entirely reasonable.

Without practice it's a lot.

Reading technical literature is a skill that develops over time. At first it goes slow. Then it gets normal. At first it might take an hour to read a 12 page paper. After a few months, fifteen minutes becomes enough to read a paper and make a cup of coffee.

polynomial · on April 7, 2023

Oh it's a great way, if you have the time. Therein lies the problem.

auggierose · on April 7, 2023

Do they just pick papers by themselves, selecting from already published papers?

ulrikhansen54 · on April 7, 2023

They pull everything from various e-print archives (arXiv, ResearchGate, etc.) AFAIK

Edit: Everything tagged with AI/ML

Veen · on April 7, 2023

That'd be great if it had an RSS feed.

zoenolan · on April 7, 2023

Some unofficial feeds exist

https://github.com/ml-feeds/pwc-feeds

AdrienBrault · on April 7, 2023

Not "HN-like", but I have found Simon Willison's blog/newsletter very helpful: - https://simonwillison.net - https://simonw.substack.com

tikkun · on April 7, 2023

My take goes against most of the other comments here – don't keep up. It's not practical, the amount of new information and development is too much to process.

d4rkp4ttern · on April 7, 2023

I have a daily workflow of scanning r/ML and HN and I subscribe to a few newsletters that I came across. I save bookmarks of tools and repos to raindrop.io and articles to readwise/reader. One good trick is to use the readwise feed email when subscribing to newsletters, so the newsletters go to Readwise instead of your personal email.

My big unsolved problem is Twitter — how do I avoid going on twitter more than a half hour a day, by using some type of twitter based filter/aggregator? Labml daily is a relatively good trend aggregator informed by Twitter. But I still keep discovering interesting things on Twitter not covered by any of the above. And BTW I bookmark twitter threads to Readwise/reader as well.

Veen · on April 7, 2023

I used to deal with the "avoiding going on Twitter" problem by subscribing to interesting AI Twitter feeds in Feedbin, an RSS aggregator. Unfortunately that doesn't work any more because the "genius" in charge revoked Feedbin's Twitter API access a week or so ago. So now I don't check Twitter at all.

https://feedbin.com/blog/2023/03/30/twitter-access-revoked/

ipaddr · on April 7, 2023

The genius didn't lose anything here. If you never go on twitter you are a feedbin user. You make twitter no money. No one likes a mooch.

411111111111111 · on April 7, 2023

You gotta be trolling, nobody is that dumb...

The people posting on Twitter do it to be heard by others. As the audience decreases, the significance of the platform decreases. Thus, people stop posting on said platform and use other avenues to get their voices heard.

As users drop off, advertisers leave, removing a large part of their revenue.

Your claim is so completely out of touch with reality...

ipaddr · on April 9, 2023

The koolaid level is high with you. No one stops posting on twitter because feedbin can't repost. You care about people engaged not someone who doesn't follow or comment or show up in the view column

gronky_ · on April 7, 2023

Who are a few of your best twitter follows for AI?

d4rkp4ttern · on April 7, 2023

I think Twitter lists are a good starting point. There are plenty of ML/AI lists around. I start from there and whittle down to good signal/noise ratio, meaning avoid people who post overly frequent fluffy hot-takes. People who retweet good stuff are also worth a follow. Ultimately what I want to get out of twitter is tools, papers, good blog-links.

d4rkp4ttern · on April 7, 2023

For specific research-oriented follows I would suggest starting with Ilya Sutskever and see who he is following or retweeting. And for practical stuff start with Simon Willison

A_D_E_P_T · on April 7, 2023

Zvi Mowshowitz's blog.

He has recently started posting incredibly detailed weekly AI roundups. Here's one from yesterday:

https://thezvi.wordpress.com/2023/04/06/ai-6-agents-of-chang...

fock · on April 7, 2023

a blog which in the past speculated about Covid, Bird flu and now tells stories about tracts around generative methods. I would not classify this as keeping up with AI in the tech sense.

A_D_E_P_T · on April 7, 2023

Zvi's updates are very comprehensive and detailed -- and his commentary is excellent, because he combines intellectual curiosity with a careful scrupulousness for factual accuracy. And his mind has some interesting corners.

I find that I always come away learning a thing or two from his updates -- and feel as though I'm keeping up with at least those developments which relate to commercially-available AI.

His blog is not a repository of scientific work like aRxiv, but more like a curated summary of AI news. It is, after all, a blog.

quaintdev · on April 7, 2023

Man I wish bloggers enabled RSS feeds because once I read this it's really hard to find them again or any of their future updates.

input_sh · on April 7, 2023

It's wordpress, just go to example.com/rss or example.com/feed and voila!

aldarisbm · on April 7, 2023

https://www.emergentmind.com/

agomez314 · on April 7, 2023

I subscribe to The Neuron which keeps me reasonably informed in a short time amount of time: https://www.theneurondaily.com

poulpy123 · on April 7, 2023

"hijacking" the post to ask where I can find a good introduction to machine learning and AI. Not how to use this or this library but the fundamentals and principles behind. Preferably something explaining clearly the principles first then explaining the maths (from the beginning, my maths are quite far now) then showing practical usage/development (in any high level language like python or julia). I do not need to jump straight to the latest algorithms, I prefer starting with building bricks first

tomduncalf · on April 7, 2023

The usual recommendations I think are: Andrew Ng’s Coursera for the fundamentals, Andrej Kaparthy’s videos (https://karpathy.ai/zero-to-hero.html) for more practical and LLM focussed, and also Fast.ai’s courses. I’ve done some of the first two and they seem great.

Buttons840 · on April 7, 2023

The Little Learner[0]: Learn the basics using scheme. This is fundamental; no frameworks, just Scheme and functions. From here you can go anywhere.

Then watch Neural Networks: Zero to Hero[1] to learn how autograd works, learn a few advance topics, and most importantly, to see what the workflow of an expert looks like.

Then read PyTorch tutorials and the PyTorch docs while doing some real projects.

If PyTorch starts to feel too rigid, then look into Jax or Julia's Flux library.

[0]: https://mitpress.mit.edu/9780262546379/the-little-learner/ [1]: https://karpathy.ai/zero-to-hero.html

gremlinsinc · on April 7, 2023

Karpathy recently tweeted that we don't need more researchers, we need prompt engineers, which goes beyond just writing prompts but actually writing code to make ai agents that carry on multiple tasks, for instance Autogpt is kind of amazing, I think you make something like that and you'd get job offers from ai labs. they're all moving from research to products, meaning they need people to create use cases around their new products.

poulpy123 · on April 8, 2023

I'm just curious and not trying to become an engineer or researcher in ML/AI.

tommiegannert · on April 7, 2023

This was posted on HN a few weeks ago: https://gist.github.com/rain-1/eebd5e5eb2784feecf450324e3341...

I just started going though it today, so I can't attest to the usefulness.

ngc248 · on April 7, 2023

Pedro Domingo's CSEP 546 course for the fundamentals

https://www.youtube.com/playlist?list=PLTPQEx-31JXgtDaC6-3Hx...

auggierose · on April 7, 2023

https://karpathy.ai/zero-to-hero.html

https://course.fast.ai

Zealotux · on April 7, 2023

You might be interested in Andrej Karpathy's intro to GPT https://www.youtube.com/watch?v=kCc8FmEb1nY and course https://karpathy.ai/zero-to-hero.html

Al0neStar · on April 7, 2023

I recommend the books by Joe Suzuki https://bayesnet.org/books/ it teaches all the math and no libray/framework is used.

edit: the readers page is not up to date the last books are available.

lysecret · on April 7, 2023

ask chatgpt.

pmoriarty · on April 7, 2023

Updates like these[1] posted regularly to the ChatGPT subreddit are pretty informative.

The real challenge is finding the time to read them all.

[1] - https://www.reddit.com/r/ChatGPT/comments/12diapw/gpt4_week_...

counttheforks · on April 7, 2023

Is it just me, or is this post complete nonsense? This AI hype seems to be proliferated by people who have never programmed more than 1kloc in their lives. But maybe that's the point?

For example:

> “babyagi” is a program that given a task, creates a task list and executes the tasks over and over again. It’s now been open sourced and is the top trending repos on Github atm [Link]. Helpful tip on running it locally [Link]

The babyagi project is is an extremely simple 180 line python script. The tips for running it is just rephrasing the readme to set some environment variables.

Is this what everyone is getting so hyped about?

gremlinsinc · on April 7, 2023

I think Autogpt might be a bit better, it's definitely more than 180 lines of code but it's not nearly as bloated as lang chain.

ackatz · on April 7, 2023

I recently started an AI news aggregator here: https://ainewsfeed.io

I am planning on adding more feeds very soon to increase the amount of content

I also have an aggregator for Cybersecurity: https://cyberfeed.io

HugoDz · on April 7, 2023

Made this :)

https://www.haickernews.com/

eon01 · on April 8, 2023

Kala - AI/ML weekly (https://faun.dev/newsletter/kala)

You'll find both curated news, stories, tutorials, tools, and in-depth content.

Disclaimer: I'm the curator of this newsletter.

zulban · on April 7, 2023

Lots of sources. However, Last Week in AI has been a great podcast since I started listening a couple months ago. Like covid, beware of resources that only started covering AI because it's trendy lately. They quickly summarize and discuss papers and news.

ReDeiPirati · on April 7, 2023

For years I have followed top researchers on Twitter and helped quite a bit to stay up to date on the topic. Today I think it's still quite good for that purpose, although the countless way that Musk is trying to make it worse...

gronky_ · on April 7, 2023

Can you share a few of your best twitter follows for AI?

ReDeiPirati · on April 7, 2023

Sure here are a couple:

- Andrej Karpathy: https://twitter.com/karpathy

- David Ha: https://twitter.com/hardmaru

- Yann LeCun: https://twitter.com/ylecun

- Jeremy Howard: https://twitter.com/jeremyphoward

- Riley Goodside: https://twitter.com/goodside

edouard-harris · on April 7, 2023

https://www.aitracker.org/ is good for a general audience, but doesn't go into as much details as some of the more research-oriented roundups.

anonzzzies · on April 7, 2023

HN. It’s curated by the smartest minds. And Arvix but that’s very work intensive.

muzani · on April 8, 2023

HN lies on the Early Majority border of the innovation curve. It's also highly resistant to new tech of all sorts and tries to bury them.

HN is okay for things you don't watch so you don't miss out on anything cool. It used to be 1-2 years behind the curve in the GPT-3 era, but now that things are moving faster, it's only around 3 months or so behind.

gronky_ · on April 7, 2023

HN with tags and filters would be great

mooreds · on April 7, 2023

This is a bit more product focused, but I've found it useful: https://www.latent.space/

It's a newsletter/podcast.

bachmitre · on April 7, 2023

This weekly newsletter is excellent: https://www.deeplearning.ai/the-batch/

ksplicer · on April 7, 2023

I check out https://papers.labml.ai/ semi-frequently to see what research twitter is talking about.

artemonster · on April 7, 2023

While we are on the topic, can somebody give a TLDR what breakthroughs made current AI advancements? From what I understand the "foundation" is exactly the same as it was 40 years ago - same neural networks, same activation functions, same architectures, same gradient descent. If I ask some "skeptical" crowd they say: "nothing is new, we just started using GPUs". Some say there were breakthroughs in learning algorithms to facilitate deep learning (i.e. that features are trained and learned by deeper layers automatically). Can someone elaborate on this, please? I tried googling and I only get crap articles that just "wave hands"

tommiegannert · on April 7, 2023

I studied machine learning 20 years ago, and have only been a casual observer since. I hope someone else corrects me where I'm wrong.

I'd say the big change was first deep nets, mostly CNNs. 20 years ago, there was a lot of talk about fully connected hidden layers. And how they are equivalent to an ANN with one hidden layer, if you allow exponential scaling of the number of perceptrons. They took forever to converge. CNNs reduce the number of parameters, and hence the training by quite a lot. They also make it obvious that even if multi-hidden layer nets are equivalent theoretically, in practice, you can cheat a lot using convolutions, thereby making this theory pointless.

Then came embeddings that segmented training into an embedding stage and a prediction stage, further reducing training time.

RNNs never really took off as a name, and seems to have been replaced with LSTM, being a subset of the vague RNN term. Having a well-understood way of introducing memory is very useful for sequences, and makes scaling a model to sequences much cheaper. So yet another input size reducing thing. (I guess we're basically building Markov processes on top of ANNs now? When I was in school, it was an either-or choice.)

On a meta-level, there are ensemble models, which is just taking many models, running them in parallel and combining the outputs. Simple idea, but apparently effective enough that people keep doing it. :)

Then there was attention being added, which adds a stage where you find similarities between entities. This means the ANN doesn't have to figure clustering, which again speeds up convergence. (I think it also makes the models more modular, but I'm not sure.)

Reinforcement learning (Q-learning from the late 1980s) arguably benefits tremendously with all the improvements to the ANN models, so I think time will tell how awesome this will be. For now, it seems we're all amazed by how much just the ANN (Q-function) can do on its own.

All of this is enabled by vastly improved matrix arithmetic hardware, software libraries and RAM prices. (Aside from each of these techniques just reducing training time/size.)

And then there's the inevitable hype bubble that just turns it all up to 11 for a while, because people with money want to invest and people without money want to pretend to know things. I wouldn't discount the availability of money as a "news" here, though.

ryanwaggoner · on April 7, 2023

This is a good question for ChatGPT

PeterStuer · on April 7, 2023

Here you go.

Sure, I can provide a brief overview of the key breakthroughs and advancements that have contributed to the current state of AI, particularly in the domain of deep learning.

1. Availability of data: The explosion of digital data, especially from the internet, has provided a massive amount of training data for AI models. This has allowed AI systems to learn patterns, features, and representations from various data sources more effectively than before.

2. Hardware improvements: The introduction of GPUs (Graphics Processing Units) and specialized hardware, like TPUs (Tensor Processing Units), has significantly accelerated the training of large neural networks. These advancements enable researchers to experiment with larger and more complex models, leading to improved performance.

3. Algorithmic innovations: Key algorithmic advancements have been made to train deep neural networks more efficiently. Some notable examples include: a. Backpropagation: This algorithm is used to train neural networks by minimizing the loss function through gradient descent. Although it was introduced in the 1980s, it became more widely used and optimized in recent years. b. Activation functions: Non-linear activation functions like ReLU (Rectified Linear Unit) have been crucial in addressing the vanishing gradient problem and improving training efficiency in deep networks. c. Dropout: This regularization technique helps prevent overfitting by randomly dropping out neurons during training, encouraging the network to learn more robust features.

4. Architectural advancements: The development of various neural network architectures has led to improved performance in specific tasks. Some prominent architectures include: a. Convolutional Neural Networks (CNNs): These networks are especially effective at image recognition tasks due to their ability to capture spatial patterns and hierarchical features. b. Recurrent Neural Networks (RNNs) and Long Short-Term Memory (LSTM): These architectures excel at handling sequence data, such as time series or natural language processing tasks. c. Transformers: Introduced in 2017, the transformer architecture has become a key component in state-of-the-art natural language processing models like BERT and GPT, due to its self-attention mechanism and ability to handle long-range dependencies.

5. Transfer learning and pre-training: Instead of training models from scratch, researchers have found it effective to pre-train models on large datasets, followed by fine-tuning them on specific tasks. This approach reduces training time, requires less labeled data, and often leads to better performance.

These breakthroughs and advancements, combined with a growing research community and increased investment in AI, have led to the current state of AI, where deep learning models can achieve human-level or near-human-level performance on a variety of tasks.

adt · on April 7, 2023

The Memo:

https://lifearchitect.ai/memo/

benrapscallion · on April 7, 2023

Synced [1] https://syncedreview.com/

lekashman · on April 7, 2023

I use https://nextomoro.com

ubj · on April 7, 2023

TLDR has an AI-specific newsletter you can sign up for:

https://tldr.tech/ai

fswd · on April 7, 2023

various discord channels if you want the latest. As much as I hate discord's UI and ecosystem, it's value in up to date information about AI can't be matched.

jasondigitized · on April 7, 2023

what channels?

pabl0rg · on April 7, 2023

Mn bc b n bn bn Ng bn v

0x008 · on April 7, 2023

youtube