Hundreds of LLM Servers Expose Corporate and Health DataHundreds of LLM Servers Expose Corporate and Health Data

A new report finds that LLM automation tools and vector databases can be rife with sensitive data — and vulnerable to pilfering.

Nate Nelson, Dark Reading

August 30, 2024

1 Min Read

3D illustration of Artificial intelligence on computer screens

Alamy

Hundreds of open source large language model (LLM) builder servers and dozens of vector databases are leaking highly sensitive information to the open Web.

As companies rush to integrate AI into their business workflows, they occasionally pay insufficient attention to how to secure these tools, and the information they trust them with. In a new report, Legit security researcher Naphtali Deutsch demonstrated as much by scanning the Web for two kinds of potentially vulnerable open source (OSS) AI services: vector databases — which store data for AI tools — and LLM application builders — specifically, the open source program Flowise. The investigation unearthed a bevy of sensitive personal and corporate data, unknowingly exposed by organizations stumbling to get in on the generative AI revolution.

"A lot of programmers see these tools on the Internet, then try to set them up in their environment," Deutsch says, but those same programmers are leaving security considerations behind.

Hundreds of Unpatched Flowise Servers

Flowise is a low-code tool for building all kinds of LLM applications. It's backed by Y Combinator, and sports tens of thousands of stars on GitHub.

Whether it be a customer support bot or a tool for generating and extracting data for downstream programming and other tasks, the programs that developers build with Flowise tend to access and manage large quantities of data. It's no wonder, then, that the majority of Flowise servers are password-protected.

A password, however, isn't security enough...

Continue reading this article on our sister site, Dark Reading.

About the Authors

Nate Nelson

Contributor

Nate Nelson is a freelance writer based in New York City. Formerly a reporter at Threatpost, he contributes to a number of cybersecurity blogs and podcasts. He writes "Malicious Life" -- an award-winning Top 20 tech podcast on Apple and Spotify -- and hosts every other episode, featuring interviews with leading voices in security. He also co-hosts "The Industrial Security Podcast," the most popular show in its field.

See more from Nate Nelson

Dark Reading

Long one of the most widely read cyber security news sites on the Web, Dark Reading, a sister site to Data Center Knowledge, is now the most trusted online community for security professionals like you. Dark Reading's community members include thought-leading security researchers, CISOs, and technology specialists, along with thousands of other security professionals.

See more from Dark Reading

Related Topics

Recent in Infrastructure

Related Topics

Recent in Build & Design

Related Topics

Recent in Ops & Mgmt

Related Topics

Recent in Business

Related Topics

Recent in Security

Related Topics

Recent in Next-Gen

Related Topics

Recent in Sustainability

Related Topics

Hundreds of LLM Servers Expose Corporate and Health DataHundreds of LLM Servers Expose Corporate and Health Data

Hundreds of Unpatched Flowise Servers

About the Authors

Editor's Choice

Industry Voices

Featured Technical Explainers

Related Topics

Recent in Infrastructure

Related Topics

Recent in Build & Design

Related Topics

Recent in Ops & Mgmt

Related Topics

Recent in Business

Related Topics

Recent in Security

Related Topics

Recent in Next-Gen

Related Topics

Recent in Sustainability

Related Topics

<span class="ArticleBase-LargeTitle">Hundreds of LLM Servers Expose Corporate and Health Data</span>Hundreds of LLM Servers Expose Corporate and Health DataHundreds of LLM Servers Expose Corporate and Health Data

Hundreds of Unpatched Flowise Servers

About the Authors

Editor's Choice

Industry Voices

Featured Technical Explainers

Hundreds of LLM Servers Expose Corporate and Health DataHundreds of LLM Servers Expose Corporate and Health Data