22
submitted 9 months ago by inspxtr@lemmy.world to c/datahoarder@lemmy.ml

I’m looking for a data archive of corporation ownership networks. For example, Alphabet owns Google, … and some metadata like when they are created/owned by Alphabet if possible. I was made aware of OpenCorporates but it doesn’t seem to have such data as far as I tried.

Apologies in advance if this is not an appropriate content for the community. I figured digital archivists may be aware of the existence of such archive. I couldn’t find a specific lemmy community solely for asking about data suggestions. If there’s a community better suited for this post, please let me know.

Thanks!

you are viewing a single comment's thread
view the rest of the comments
[-] cmd@discuss.tchncs.de 1 points 9 months ago

I would also be really interested in a dataset like this. Institutional investors for public companies are sometimes listed in the SEC 10k form, but I have yet to fully learn the structure of these documents. Someone intelligent could probably make a web crawler to scrape institutional and single person ownership of companies from SEC filings. Ideally with a proper dataset like this you could map out who owns each company, and who owns each of the owning companies etc. which would be really interesting information.

[-] cmd@discuss.tchncs.de 1 points 9 months ago

I don’t currently have adequate understanding of different SEC FORMS, and although I can use python I have no experience writing web crawlers at the moment.

[-] inspxtr@lemmy.world 1 points 9 months ago

I dont have any experience with understanding SEC forms either. Is there a repository for SEC forms? Or do you imagine looking at all companies website to mine for those forms?

[-] cmd@discuss.tchncs.de 2 points 9 months ago

SEC has the Edgar database where you can lookup any company and access there different SEC forms, but you still need to know which forms to look for the information in. For example, the 10k of one company had the ownership informing of top shareholders, but I wasn’t able to find that info in the 10k of another company (possibly because I didn’t know where to look). I know you can use EDGAR database to at least lookup these forms, but I do not know the full capabilities of the database (such as if you can query for ownership directly) because I just discovered it the other day.

[-] cmd@discuss.tchncs.de 1 points 9 months ago

It looks like EDGAR has a rest API and a full text search as well

this post was submitted on 18 Sep 2023
22 points (92.3% liked)

datahoarder

6271 readers
1 users here now

Who are we?

We are digital librarians. Among us are represented the various reasons to keep data -- legal requirements, competitive requirements, uncertainty of permanence of cloud services, distaste for transmitting your data externally (e.g. government or corporate espionage), cultural and familial archivists, internet collapse preppers, and people who do it themselves so they're sure it's done right. Everyone has their reasons for curating the data they have decided to keep (either forever or For A Damn Long Time). Along the way we have sought out like-minded individuals to exchange strategies, war stories, and cautionary tales of failures.

We are one. We are legion. And we're trying really hard not to forget.

-- 5-4-3-2-1-bang from this thread

founded 4 years ago
MODERATORS