AussieBytes Blog ◖ᵔᴥᵔ◗

AI Resource Hub

A curated, no-fluff AI-related index of live dashboards, model cards, evals, policy, safety, chips/energy, and price signals I use to separate hype from reality. Updated: 18 Aug 2025.


TL;DR


How to use this page


Live leaderboards & evals (what’s hot right now)

How to use: Arena = “vibes + breadth”, MLPerf = “hardware truth”, SWE-bench = “agentic coding realism”, ProphetArena/FutureBench = “can it forecast?”, AA = “how smart + how much?”.


Model release notes & changelogs (source of truth)

Tips: Read release notes before the blog hype; they list deprecations, limits, and pricing changes.


Official model cards & open weights

Tips: sanity-check safety scopes, context limits, modalities, and licence constraints.


Incidents, red-teaming & security

Why it matters: Helps plan safety and alignment protocols and quantify risks


Policy, standards & governance

Use: map internal controls and vendor due-diligence to internal controls.


Compute, chips & energy (follow the supply)

How it helps: chips + grid constraints often explain model availability and API limits better than press releases. Currently, model intelligence is directly tied to increases in compute and chip increased availability and innovation.


GPU price signals (live)

Watch: falling rental prices can pre-signal “capacity relief” and cheaper fine-tunes.


Benchmarks to watch for agents & reasoning

Rule of thumb: prefer evals with transparent task lists, cost accounting, and reproduction kits.


Why here: legal direction shapes training data access, indemnities, and enterprise risk posture.


Sustainability & emissions

Use to back-of-the-envelope the footprint of training/fine-tune plans.


🇦🇺 Australian perspective

Local edge: align deployments to APPs (privacy), safety guidance, and critical infrastructure constraints.


“Follow along” — trustworthy research & market primers


Spotted a must-have resource or a broken link? Ping me at my LinkedIn or X accounts.