llm/2ad2a7bb-5462-4391-a2da-bf11064993c9/batch-10-37f9d7a2-1cf8-41e7-8a84-f62f8693e52b-input.json
The following is content for you to classify. Do not respond to the comments—classify them.
<topics>
1. ARC-AGI Benchmark Validity
Related: Debate over whether ARC-AGI measures general intelligence or just spatial reasoning puzzles, concerns about benchmarkmaxxing, semi-private vs private test sets, cost per task at $13.62, and whether solving it indicates anything meaningful about AGI capabilities
2. Gemini vs Claude for Coding
Related: Strong consensus that Claude dominates agentic coding workflows while Gemini lags behind, discussion of tool calling failures, instruction following issues, and hallucinations when using Gemini for development tasks
3. Benchmarkmaxxing Concerns
Related: Skepticism that high benchmark scores reflect real-world performance, suspicions that labs optimize specifically for popular tests, concerns about training data leakage, and debate over whether improvements are genuine or gamed
4. Definition of AGI
Related: Philosophical debate about what constitutes artificial general intelligence, whether consciousness is required, Chollet's definition involving tasks feasible for humans but unsolved by AI, and moving goalposts in AI evaluation
5. Google Product Quality Issues
Related: Complaints about Gemini app UX problems including context loss, Russian propaganda sources, switching languages mid-sentence, document upload failures, and poor integration compared to ChatGPT
6. Balatro Gaming Benchmark
Related: Discussion of Gemini 3's ability to play the card game Balatro from text descriptions alone, debate over whether this demonstrates generalization, and comparisons showing other models like DeepSeek failing at the task
7. Model Release Acceleration
Related: Observation that AI model releases are accelerating dramatically, multiple frontier models released within days, connection to Chinese New Year timing, and competition between US and Chinese labs
8. Cost vs Performance Tradeoffs
Related: Analysis of inference costs versus capabilities, Gemini Flash praised for cost-performance ratio, concerns about $13.62 per ARC-AGI task, and debate over what price makes models practical for real applications
9. Deep Research Reliability
Related: Mixed experiences with AI deep research capabilities, complaints about garbage citations, hallucinated sources, contradictory information, and questions about whether it saves time when sources must be verified
10. Google's Competitive Position
Related: Debate over whether Google is leading or behind in AI, discussion of their data advantages from YouTube and Books, claims they let competitors think they were behind, and analysis of their strengths in visual AI
11. Pelican on Bicycle Benchmark
Related: Simon Willison's informal SVG generation test, discussion of whether it's being trained on specifically, quality improvements in latest models, and debate over its validity as a casual benchmark
12. AI Consciousness Claims
Related: Pushback against suggestions that passing tests indicates consciousness, comparisons to simple programs claiming consciousness, discussion of self-awareness research, and skepticism about anthropomorphizing AI capabilities
13. Test Time Compute Approaches
Related: Analysis of thinking vs non-thinking models, best-of-N approaches like Deep Think, computational complexity differences, and questions about whether sufficiently large non-thinking models can match smaller thinking ones
14. Real World Task Performance
Related: Frustration that benchmark gains don't translate to practical improvements, examples of models failing simple debugging tasks, and arguments that actual work product matters more than test scores
15. AI Job Displacement Fears
Related: Concerns about software engineers being replaced, comparisons to factory worker displacement, debate over whether AI creates or destroys jobs, and skepticism about optimistic narratives from AI company executives
16. Spatial Reasoning Limitations
Related: Discussion of LLMs struggling with spatial tasks, image orientation affecting OCR accuracy, and whether ARC-AGI improvements indicate genuine spatial reasoning advances or benchmark-specific solutions
17. Model Architecture Secrecy
Related: Observation that frontier labs no longer share architecture details like parameter counts, shift from technical discussions to capability-focused marketing, and desire for more transparency
18. Academic vs Practical Intelligence
Related: Distinction between Gemini excelling at academic benchmarks while feeling less useful for practical tasks, discussion of book smart vs street smart analogies for AI capabilities
19. First Proof Mathematical Challenge
Related: Discussion of newly released unsolved math problems designed to test frontier models, predictions about whether current models can solve genuine research-level mathematics
20. Subscription Pricing Frustration
Related: Complaints about $250/month Google AI Ultra subscription required for Deep Think access, desire to test new models without platform lock-in, and calls for OpenRouter availability
0. Does not fit well in any category
</topics>
<comments_to_classify>
[
{
"id": "47001080",
"text": "Wait till we get to the point where we can ask AI to create a better AI."
}
,
{
"id": "47001107",
"text": "Right now I'm still stuck with AI that can't even install other AI."
}
,
{
"id": "46992232",
"text": "Not trained for agentic workflows yet unfortunately - this looks like it will be fantastic when they have an agent friendly one. Super exciting."
}
,
{
"id": "46993123",
"text": "Its really weird how you all are begging to be replaced by llms, you think if agentic workflows get good enough you're going to keep your job? Or not have your salary reduced by 50%?\n\nIf Agents get good enough it's not going to build some profitable startup for you (or whatever people think they're doing with the llm slot machines) because that implies that anyone else with access to that agent can just copy you, its what they're designed to do... launder IP/Copyright. Its weird to see people get excited for this technology.\n\nNone of this good. We are simply going to have our workforces replaced by assets owned by Google, Anthropic and OpenAI. We'll all be fighting for the same barista jobs, or miserable factory jobs. Take note on how all these CEOs are trying to make it sound cool to \"go to trade school\" or how we need \"strong American workers to work in factories\"."
}
,
{
"id": "46995495",
"text": "> Its really weird how you all are begging to be replaced by llms, you think if agentic workflows get good enough you're going to keep your job? Or not have your salary reduced by 50%?\n\nThe computer industry (including SW) has been in the business of replacing jobs for decades - since the 70's. It's only fitting that SW engineers finally become the target."
}
,
{
"id": "47000154",
"text": "Is that really true? Software created an incredible amount of new types of jobs and markets."
}
,
{
"id": "47000052",
"text": "The most gullible workforce ever (FOSS), but seeing Youtube, half the planet is braindead for handing over their craft on a platter for mere dollars."
}
,
{
"id": "46994920",
"text": "I think a lot of people assume they will become highly paid Agent orchestrators or some such. I don't think anyone really knows where things are heading."
}
,
{
"id": "46997674",
"text": "Why would someone get paid well for this skill? Its not valuable at all."
}
,
{
"id": "47000085",
"text": "Highly valuable right now with how high leverage it can make a good engineer. Who knows for how long."
}
,
{
"id": "46993890",
"text": "Most folks don't seem to think that far down the line, or they haven't caught on to the reality that the people who actually make decisions will make the obvious kind of decisions (ex: fire the humans, cut the pay, etc) that they already make."
}
,
{
"id": "46995975",
"text": "they think they're going to be the person making that decision\n\nbut forgot there's likely someone above them making exactly the same one about them"
}
,
{
"id": "46995717",
"text": "I agree with you and have similar thoughts (maybe, unfortunately for me). I personally know people who outsource not just their work, but also their life to LLMs, and reading their exciting comments makes me feel a mix of cringe, fomo and dread. But what is the engame for me and you likes, when we finally would be evicted from our own craft? Stash money while we still can, watching 'world crash and burn', and then go and try to ascend in some other, not yet automated craft?"
}
,
{
"id": "46996095",
"text": "Yeah, that's a good question that I can't stop thinking about. I don't really enjoy much else other than building software, its genuinely my favorite thing to do. Maybe there will be a world where we aren't completely replaced, we have handmade clothes still after all that are highly coveted. I just worry its going to uproot more than just software engineering, theoretically it shouldn't be hard to replace all low hanging fruit in the realm of anything that deals with computer I/O. Previous generations of automation have created new opportunities for humans, but this seems mostly just as a means of replacement. The advent of mass transportation/vehicles created machines who needed mechanics (and eventually software), I don't see that happening in this new paradigm.\n\nI don't think that's going to make society very pleasant if everyone's fighting over the few remaining ways to make livelihood. People need to work to eat. I certainly don't see the capitalist class giving everyone UBI and letting us garden or paint for the rest of our lives. I worry we're likely going to end up in trenches or purged through some other means."
}
,
{
"id": "46999310",
"text": "If you want to know where it's headed, look at factory workers 40 years ago. Lots of people still work at factories today, they just aren't in the same places they were 40 years ago and now req an entirely different skill set.\n\nThe largest ongoing expense of every company is labor and software devs are some of the highest paid labor on the planet. AI will eventually drive down wages for this class of workers most likely by shipping these jobs to people in other countries where labor is much cheaper. Just like factory work did.\n\nEnjoy the good times while they last (or get a job at an AI company)."
}
,
{
"id": "46996459",
"text": "I’m someone who’d like to deploy a lot more workers than I want to manage.\n\nPut another way, I’m on the capital side of the conversation.\n\nThe good news for labor that has experience and creativity is that it just started costing 1/100,000 what it used to to get on that side of the equation."
}
,
{
"id": "46997146",
"text": "If LLMs truly cause widespread replacement of labor, you’re screwed just as much as anyone else. If we hit say 40% unemployment do you think people will care you own your home or not? Do you think people will care you have currency or not? The best case outcome will be universal income and a pseudo utopia where everyone does ok. The “bad” scenario is widespread war.\n\nI am one of the “haves” and am not looking forward to the instability this may bring. Literally no one should."
}
,
{
"id": "46997249",
"text": "> I am one of the “haves” and am not looking forward to the instability this may bring. Literally no one should.\n\nthese people always forget capitalism is permitted to exist by consent of the people\n\nif there's 40% unemployment it won't continue to exist, regardless of what the TV/tiktok/chatgpt says"
}
,
{
"id": "46999980",
"text": "SSSTik is a free TikTok download tool that helps you download TikTok videos without watermark online. ssstt.app Save TT videos with the highest quality in an MP4 file format and HD resolution. To find out how to use the TikTok watermark remover, follow the instructions below The website www.ssstt. app offers a complimentary tool designed to facilitate the download of TikTok videos (previously known as Musically) without watermarks. Users can save TikTok videos in high-definition MP4 format, ensuring optimal quality. To learn how to utilize the TikTok video downloader, please follow the instructions outlined below. The process is straightforward, allowing you to download watermark-free TikTok videos in just three simple steps. Additionally, ssstiktok provides a free service for converting and downloading TikTok audio in MP3 format."
}
,
{
"id": "46997781",
"text": "Well he also thinks $10.00 in LLM tokens is equivalent to a $1mm labor budget. These are the same people who were grifting during the NFTs days, claiming they were the future of art."
}
,
{
"id": "46997667",
"text": "lmao, you are an idealistic moron. If llms can replace labor at 1/100k of the cost (lmfao) why are you looking to \"deploy\" more workers? So are you trying to say if I have $100.00 in tokens I have the equivalent of $10mm in labor potential.... What kind of statement is this?\n\nThis is truly the dumbest statement I've ever seen on this site for too many reasons to list.\n\nYou people sound like NFT people in 2021 telling people that they're creating and redefining art.\n\nOh look [email protected] is a \"web3\" guy. Its all the same grifters from the NFT days behaving the same way."
}
,
{
"id": "47000043",
"text": "But the head honchos on ted.com said AI will create more jobs."
}
,
{
"id": "46995160",
"text": "You don't hate AI, you hate capitalism. All the problems you have listed are not AI issues, its this crappy system where efficiency gains always end up with the capital owners."
}
,
{
"id": "46997952",
"text": "Do we know what model is used by Google Search to generate the AI summary?\n\nI've noticed this week the AI summary now has a loader \"Thinking…\" (no idea if it was already there a few weeks ago). And after \"Thinking…\" it says \"Searching…\" and shows a list of favicons of popular websites (I guess it's generating the list of links on the right side of the AI summary?)."
}
,
{
"id": "46996939",
"text": "Off topic comment (sorry): when people bash \"models that are not their favorite model\" I often wonder if they have done the engineering work to properly use the other models. Different models and architectures often require very different engineering to properly use them. Also, I think it is fine and proper that different developers prefer different models. We are in early days and variety is great."
}
,
{
"id": "46996376",
"text": "I do like google models (and I pay for them), but the lack of competitive agent is a major flaw in Google's offering. It is simply not good enough in comparison to claude code. I wish they put some effort there (as I don't want to pay two subscriptions to both google and anthropic)"
}
,
{
"id": "47000464",
"text": "I tried to debug a Wireguard VPN issue. No luck.\n\nWe need more than AGI."
}
,
{
"id": "46995263",
"text": "I'm really interested in the 3D STL-from-photo process they demo in the video.\n\nNot interested enough to pay $250 to try it out though."
}
,
{
"id": "46998717",
"text": "Is this not yet available for workspace users? I clicked on the Upgrade to Google AI Ultra button on the Gemini app and the page it takes me to still shows Gemini 2.5 Deep Think as an added feature. Wondering if that's just outdated info"
}
,
{
"id": "46993043",
"text": "top 10 elo in codeforces is pretty absurd"
}
,
{
"id": "46996698",
"text": "The benchmark should be: can you ask it to create a profitable business or product and send you the profit?\n\nEverything else is bike shedding."
}
,
{
"id": "46997454",
"text": "So what happens if the AI companies can't make money? I see more and more advances and breakthrough but they are taking in debt and no revenue in sight.\n\nI seem to understand debt is very bad here since they could just sell more shares, but aren't (either valuation is stretched or no buyers).\n\nJust a recession? Something else? Aren't they very very big to fall?\n\nEdit0: Revenue isn't the right word, profit is more correct. Amazon not being profitable fucks with my understanding of buisness. Not an economist."
}
,
{
"id": "46997631",
"text": ">taking in debt and no revenue in sight.\n\nwhich companies don't have revenue? anthropic is at a run rate of 14 billion (up from 9B in December, which was up from 4B in July). Did you mean profit? They expect to be cash flow positive in 2028."
}
,
{
"id": "46998304",
"text": "Yes thank you, mixing my brushes here - I remembered one of the companies having raised over 100b and having about 10b in revenue."
}
,
{
"id": "46997500",
"text": "AI will kill SaaS moats and thus revenue. Anyone can build new SaaS quickly. Lots of competition will lead to marginal profits.\n\nAI will kill advertising. Whatever sits at the top \"pane of glass\" will be able to filter ads out. Personal agents and bots will filter ads out.\n\nAI will kill social media. The internet will fill with spam.\n\nAI models will become commodity. Unless singularity, no frontier model will stay in the lead. There's competition from all angles. They're easy to build, just capital intensive (though this is only because of speed).\n\nAll this leaves is infrastructure."
}
,
{
"id": "46997675",
"text": "Not following some of the jumps here.\n\nAdvertising, how will they kill ads any better than the current cat and mouse games with ad blockers?\n\nSocial Media, how will they kill social media? Probably 80% of the LinkedIn posts are touched by AI (lots of people spend time crafting them, so even if AI doesn't write the whole thing you know they ran the long ones through one) but I'm still reading (ok maybe skimming) the posts."
}
,
{
"id": "46998012",
"text": "> Advertising, how will they kill ads any better than the current cat and mouse games with ad blockers?\n\nThe Ad Blocker cat and mouse game relies on human-written metaheuristics and rules. It's annoying for humans to keep up. It's difficult to install.\n\nAgents/Bots or super slim detection models will easily be able to train on ads and nuke them whatever form they come in: javascript, inline DOM, text content, video content.\n\nTrain an anti-Ad model and it will cleanse the web of ads. You just need a place to run it from the top.\n\nYou wouldn't even have to embed this into a browser. It could run in memory with permissions to overwrite the memory of other applications.\n\n> Social Media, how will they kill social media?\n\nMoltClawd was only the beginning. Soon the signal will become so noisy it will be intolerable. Just this week, X's Nikita Bier suggested we have less than six months before he sees no solution.\n\nSpeaking of X, they just took down Higgsfield's (valued at $1.3B) main account because they were doing it across a molt bot army, and they're not the only ones. Extreme measures were the only thing they could do. For the distributed spam army, there will be no fix. People are already getting phone calls from this stuff."
}
,
{
"id": "46998768",
"text": "> AI will kill SaaS moats and thus revenue. Anyone can build new SaaS quickly.\n\nI'm LLM-positive but for me this is a stretch. Seeing it pop up all over media in the past couple weeks also makes me suspect astrofurfing. Like a few years back when there were a zillion articles saying voice search was the future and nobody used regular web search any more."
}
,
{
"id": "46998721",
"text": "AI models will simply build the ads into the responses, seamlessly. How do you filter out ads when you search for suggestions for products, and the AI companies suggest paid products in the responses?\n\nBased on current laws, does this even have to be disclosed? Will laws be passed to require disclosure?"
}
,
{
"id": "47000965",
"text": "What happens if oil companies can't make money? They will restructure society so they can. That's the essence of capitalism, the willingness to restructure society to chase growth.\n\nObviously this tech is profitable in some world. Car companies can't make money if we live in walking distance and people walk on roads."
}
,
{
"id": "46997531",
"text": "They're using the ride share app playbook. Subsidize the product to reach market saturation. Once you've found a market segment that depends on your product you raise the price to break even. One major difference though is that ride share's haven't really changed in capabilities since they launched: it's a map that shows a little car with your driver coming and a pin where you're going. But it's reasonable to believe that AI will have new fundamental capabilities in the 2030s, 2040s, and so on."
}
,
{
"id": "46994423",
"text": "Praying this isn't another Llama4 situation where the benchmark numbers are cooked. 84.6% on Arc-AGI is incredible!"
}
,
{
"id": "46997276",
"text": "I think I'm finally realizing that my job probably won't exist in 3-5. Things are moving so fast now that the LLMs are basically writing themselves. I think the earlier iterations moved slower because they were limited by human ability and productivity limitations."
}
,
{
"id": "46999928",
"text": "When will AI come up with a cure / vaccine for the common cold? and then cancer next?"
}
,
{
"id": "47000005",
"text": "Race for solving baldness :D"
}
,
{
"id": "47001104",
"text": "Dutasteride already exists for that, been on it almost 10 years soon and it's great. Although if you are already bald it is kind of moot."
}
,
{
"id": "46991708",
"text": "Unfortunately, it's only available in the Ultra subscription if it's available at all."
}
,
{
"id": "46998504",
"text": "Nonsense releases. Until they allow for medical diagnosis and legal advice who cares? You own all the prompts and outputs but somehow they can still modify them and censor them? No.\n\nThese 'Ai' are just sophisticated data collection machines, with the ability to generate meh code."
}
,
{
"id": "46993312",
"text": "Gemini was awesome and now it’s garbage.\n\nIt’s impossible for it to do anything but cut code down, drop features, lose stuff and give you less than the code you put in.\n\nIt’s puzzling because it spent months at the head of the pack now I don’t use it at all because why do I want any of those things when I’m doing development.\n\nI’m a paid subscriber but there’s no point any more I’ll spend the money on Claude 4.6 instead."
}
,
{
"id": "46993485",
"text": "I never found it useful for code. It produced garbage littered with gigantic comments.\n\nMe: Remove comments\n\nLiterally Gemini: // Comments were removed"
}
]
</comments_to_classify>
Based on the comments above, assign each to up to 3 relevant topics.
Return ONLY a JSON array with this exact structure (no other text):
[
{
"id": "comment_id_1",
"topics": [
1,
3,
5
]
}
,
{
"id": "comment_id_2",
"topics": [
2
]
}
,
{
"id": "comment_id_3",
"topics": [
0
]
}
,
...
]
Rules:
- Each comment can have 0 to 3 topics
- Use 1-based topic indices for matches
- Use index 0 if the comment does not fit well in any category
- Only assign topics that are genuinely relevant to the comment
Remember: Output ONLY the JSON array, no other text.
50