Hackernews posts about Data Commons
- Data Commons (datacommons.org)
- Tragedy of the (Data) Commons (stackoverflow.blog)
- Show HN: Nova JavaScript Engine (github.com)
- Show HN: Dribdat – a honeycomb challenge board for sweeter hackathons (demo.dribdat.cc)
- Show HN: Tab Sync Pro – Automatically organize browser tabs by domain (chromewebstore.google.com)
- Show HN: GitHub Repo Converter – Transform Any Repo into LLM-Optimized Format (www.gitdevtool.com)
- Show HN: Interactive price comparison for AI models (LLMs and Speech-to-Text) (ai-pricing.vercel.app)
- Becoming Data Driven, from First Principles (commoncog.com)
- Show HN: We open-sourced our compost monitoring tech (github.com)
- Show HN: Fireproof – local-first database with Git-like encrypted sync (fireproof.storage)
- Show HN: Aesthetic Computer (github.com)
- Grounding AI in reality with a little help from Data Commons (research.google)
- Common Data Structures in Common Lisp (blog.djhaskin.com)
- Data Commons (datacommons.org)
- Grounding AI in reality with a little help from Data Commons (research.google)
- The Rapid Decline of the AI Data Commons [pdf] (www.dataprovenance.org)
- The Rapid Decline of the AI Data Commons (www.dataprovenance.org)
- Practical Framework for Applying Ostrom’s Principles to Data Commons Governance (foundation.mozilla.org)
- Consent in Crisis: The Rapid Decline of the AI Data Commons (www.dataprovenance.org)
- Consent in Crisis: The Rapid Decline of the AI Data Commons [pdf] (www.dataprovenance.org)
- Web Data Commons (webdatacommons.org)
- Show HN: Phidata – Building Blocks for Data Engineering (docs.phidata.com)
- Show HN: GA3-exporter. Save your Google Analytics 3 data before it's gone (ga3-exporter.com)
- Training Data for the Price of a Sandwich: Common Crawl's Impact on Gen AI (foundation.mozilla.org)
- Training Data for the Price of a Sandwich: Common Crawl's Impact on Generative (foundation.mozilla.org)
- Knowing When to Ask – Bridging Large Language Models and Data [PDF] (docs.datacommons.org)
- Large language model data pipelines and Common Crawl (blog.christianperone.com)
- Large language model data pipelines and Common Crawl (WARC/WAT/WET) formats (blog.christianperone.com)
- Discovering Shopify Domains: A Journey Through Common Crawl Data (alistechtales.substack.com)