How, exactly?
They should sure, who’s coughing up the tens of millions of dollars that might cost?
If they don’t have the resources to do it, they can’t.
Distributed filesystems that self hosters can support may be the future for resilient data, but we’re not really there yet in a scalable way.
Samesies…
It’s very difficult though, sourcing material is difficult enough, archiving and making it actually useful and valuable even moreso. It takes a lot of intelligent processing.
LLMs can reduce that effort a lot, on the searching side, but that’s very expensive either in hardware or in API costs. And either way, would likely involve the efforts of a team to achieve.