How difficult / expensive / useful would it be to download many millions of pages of PDFs from all the state utility commission websites and FERC's eLibrary into cloud storage and train an LLM on them? It seems like being able to conversationally query this kind of massive domain-specific corpus could be very valuable to climate & clean-energy advocates who can't pay an army of lawyers.
npub1f6a33pfyp67y8llhunlhrf855xm47n3fdqymvxfj7yx78c6vqf4scxpnql (npub1f6a…pnql) npub1l8f53alz59sttpkrg9wyts4cvp6zgpngg58eptp69h6yvtnf3etqr89lph (npub1l8f…9lph)
npub1aftq7fvaa4vcayhq75lqjsv85u4kxggp7uhhkhfypsmekgjrqmwsm0lprp (npub1aft…lprp) npub1fdmn3hdng6e8sx7js2v3j2ek26lcwmc72zs8eddumv5qrfmwdlzsckxrvv (npub1fdm…xrvv) npub1stl29g9csuykausqzjddg864j2hpusa8npyth02kzqqd08rnzn4qz28ldd (npub1stl…8ldd) npub14k5d2q4snxzf4c6c34e5vhq45sd5gddy24nc07md92xs2uqt6c3sg78z3m (npub14k5…8z3m) npub1yt5y5luy6rxlp455cu9ghqryaukt4sn8sysr7m4a489cdnua0v2sacxep7 (npub1yt5…xep7) npub1yknuekwghw7xl7ska3nmhvwpw3wr20ytlwjaf3veeunqauehmq5qg88erd (npub1ykn…8erd)