Monday, March 31, 2025

New top story on Hacker News: Show HN: GuMCP – Open-source MCP servers, hosted for free

Show HN: GuMCP – Open-source MCP servers, hosted for free
18 by murb | 3 comments on Hacker News.
Hello! We open sourced all our current MCP servers to platforms like Slack, Google sheets, Linear, Perplexity and will be contributing a few more integrations every day to the project. problems we're hoping to solve: - Many people are creating MCP servers for the same apps. They're scattered across different repos but flavors of the same thing. We're making one standardized mono project for all MCP servers. - Startups are charging for hosting MCP servers. This is blocking tons of people from being able to play around with MCP casually. We're hosting them for free. - Non-technical people should be able to use MCP without needing to learn how to clone a repo and set up a venv. We're trying to enable a one click integration if people want to use the free hosted service. The plan is to keep contributing until we have an MCP server for basically every useful app anyone could want.

Sunday, March 30, 2025

Saturday, March 29, 2025

Friday, March 28, 2025

New top story on Hacker News: Show HN: Hexi, modern header-only network binary serialisation for C++ hackers

Show HN: Hexi, modern header-only network binary serialisation for C++ hackers
28 by Chaosvex | 6 comments on Hacker News.
Over the last few years, I've needed an easy way to quickly serialise and deserialise various network protocols safely and efficiently. Most of the libraries that existed at the time were either quite heavy, had less than stellar performance, or were an abstraction level above what I was looking for. I decided to put together my own class to do the job, starting with an easy, low-overhead way to move bytes in and out of arbitrary buffers. Along the way, it picked up useful bits and pieces, such as buffer structures and allocators that made the byte shuffling faster, often being able to do it with zero allocations and zero copies. Safety features came along to make sure that malicious packet data or mistakes in the code wouldn't result in segfaults or vulnerabilities. It's become useful enough to me that I've packaged it up in its own standalone library on the chance that it might be useful to others. It has zero dependencies other than the standard library and has been designed for quick integration into any project within minutes, or seconds with a copy paste of the amalgamated header. It can be used in production code but it's also ideal for for those that want to quickly hack away at binary data with minimal fuss.

Thursday, March 27, 2025

Wednesday, March 26, 2025

Tuesday, March 25, 2025

Monday, March 24, 2025

Sunday, March 23, 2025

Saturday, March 22, 2025

Friday, March 21, 2025

Thursday, March 20, 2025

New top story on Hacker News: Show HN: AgentKit – JavaScript Alternative to OpenAI Agents SDK with Native MCP

Show HN: AgentKit – JavaScript Alternative to OpenAI Agents SDK with Native MCP
35 by tonyhb | 9 comments on Hacker News.
Hi HN! I’m Tony, co-founder of Inngest. I wanted to share AgentKit, our Typescript multi-agent library we’ve been cooking and testing with some early users in prod for months. Although OpenAI’s Agents SDK has been launched since, we think an Agent framework should offer more deterministic and flexible routing, work with multiple model providers, embrace MCP (for rich tooling), and support the unstoppable and growing community of TypeScript AI developers by enabling a smooth transition to production use cases. This is why we are building AgentKit, and we’re really excited about it for a few reasons: Firstly, it’s simple. We embrace KISS principles brought by Anthropic and HuggingFace by allowing you to gradually add autonomy to your AgentKit program using primitives: - Agents: LLM calls that can be combined with prompts, tools, and MCP native support. - Networks: a simple way to get Agents to collaborate with a shared State, including handoff. - State: combines conversation history with a fully typed state machine, used in routing. - Routers: where the autonomy lives, from code-based to LLM-based (ex: ReAct) orchestration The routers are where the magic happens, and allow you to build deterministic, reliable, testable agents. AgentKit routing works as follows: the network calls itself in a loop, inspecting the State to determine which agents to call next using a router. The returned agent runs, then optionally updates state data using its tools. On the next loop, the network inspects state data and conversation history, and determines which new agent to run. This fully typed state machine routing allows you to deterministically build agents using any of the effective agent patterns — which means your code is easy to read, edit, understand, and debug. This also makes handoff incredibly easy: you define when agents should hand off to each other using regular code and state (or by calling an LLM in the router for AI-based routing). This is similar to the OpenAI Agents SDK but easier to manage, plan, and build. Then comes the local development and moving to production capabilities. AgentKit is compatible with Inngest’s tooling, meaning that you can test agents using Inngest’s local DevServer, which provides traces, inputs, outputs, replay, tool, and MCP inputs and outputs, and (soon) a step-over debugger so that you can easily understand and visually see what's happening in the agent loop. In production, you can also optionally combine AgentKit with Inngest for fault-tolerant execution. Each agent’s LLM call is wrapped in a step, and tools can use multiple steps to incorporate things like human-in-the-loop. This gives you native orchestration, observability, and out-of-the-box scale. You will find the documentation as an example of an AgentKit SWE-bench and multiple Coding Agent examples. It’s fully open-source under the Apache 2 license. If you want to get started: - npm: npm i @inngest/agent-kit - GitHub: https://ift.tt/lDUILJt - Docs: https://ift.tt/pfS9Hda We’re excited to finally launch AgentKit; let us know what you think!

BBC Verify: Assessing Israel's renewed ground operation

Israel has sent more troops into Gaza after resuming airstrikes on what it calls "terror targets". The bombing has killed more than 500 people in the last 72 hours, according to the Hamas-run health ministry. BBC Verify's Nick Beake has been looking at where the Israeli troops are now operating and the impact it's having on Palestinians in the territory.

from BBC News https://ift.tt/ot9W8sX
via IFTTT

Wednesday, March 19, 2025

Tuesday, March 18, 2025

Monday, March 17, 2025

Sunday, March 16, 2025

Saturday, March 15, 2025

Friday, March 14, 2025

New top story on Hacker News: Show HN: OCR Benchmark Focusing on Automation

Show HN: OCR Benchmark Focusing on Automation
3 by prats226 | 0 comments on Hacker News.
OCR/Document extraction field has seen lot of action recently with releases like Mixtral OCR, Andrew Ng's agentic document processing etc. Also there are several benchmarks for OCR, however all testing for something slightly different which make good comparison of models very hard. To give an example, some models like mixtral-ocr only try to convert a document to markdown format. You have to use another LLM on top of it to get the final result. Some VLM’s directly give structured information like key fields from documents like invoices, but you have to either add business rules on top of it or use some LLM as a judge kind of system to get sense of which output needs to be manually reviewed or can be taken as correct output. No benchmark attempts to measure the actual rate of automation you can achieve. We have tried to solve this problem with a benchmark that is only applicable for documents/usecases where you are looking for automation and its trying to measure that end to end automation level of different models or systems. We have collected a dataset that represents documents like invoices etc which are applicable in processes where automation is needed vs are more copilot in nature where you would need to chat with document. Also have annotated these documents and published the dataset and repo so it can be extended. Here is writeup: https://ift.tt/3C5HeEQ Dataset: https://ift.tt/sBF0WqL Github: https://ift.tt/vrb13om Looking for suggestions on how this benchmark can be improved further.

Thursday, March 13, 2025

Wednesday, March 12, 2025

Tuesday, March 11, 2025

Monday, March 10, 2025

Sunday, March 9, 2025

Saturday, March 8, 2025

Friday, March 7, 2025

Thursday, March 6, 2025

Wednesday, March 5, 2025

Tuesday, March 4, 2025

New top story on Hacker News: Show HN: Open-source Deep Research across workplace applications

Show HN: Open-source Deep Research across workplace applications
8 by yuhongsun | 1 comments on Hacker News.
I’ve been using deep research on OpenAI and Perplexity and it’s been just amazing at gathering data across a lot of related and chained searches. Just earlier today, I asked “What are some marquee tech companies / hot startups (not including the giants like FAAMG, Samsung, Nvidia etc.)”. It’s a pretty involved question and looking up “marquee tech startups” or "hot tech startups" on Google gave me nothing useful. Deep research on both ChatGPT and Perplexity gave really high quality responses with ChatGPT siding on slightly larger scaleups and Perplexity siding more on up and coming companies. Given how useful AI research agents are across the internet, we decided to build an open-source equivalent for the workplace since a ton of questions at work also cannot be easily resolved with a single search. Onyx supports deep research connected to company applications like Google Drive, Salesforce, Sharepoint, GitHub, Slack, and 30+ others. For example, an engineer may want to know “What’s happening with the verification email failure?” Onyx’s AI agent would first figure out what it needs to answer this question: What is the cause of the failure, what has been done to address it, has this come up before, and what’s the latest status on the issue. The agent would run parallel searches through Confluence, email, Slack, and GitHub to get the answers to these then combine them to build a coherent overview. If the agent finds that there was a technical blocker that will delay the resolution, it will adjust mid-flight and research to get more context on the blocker. Here’s a video demo I recorded: https://www.youtube.com/watch?v=drvC0fWG4hE If you want to get started with the GitHub repo, you can check out our guides at https://docs.onyx.app . Or to play with it without needing to deploy anything, you can go to https://ift.tt/cmxkPvu P.S. There’s a lot of cool technical details behind building a system like this so I’ll continue the conversation in the comments.

Monday, March 3, 2025

Sunday, March 2, 2025

Saturday, March 1, 2025

Popular Posts

Recent Posts

Unordered List

Text Widget

Blog Archive

Search This Blog

Powered by Blogger.