Java Community News - Why Processes Scale Better Than Threads

Articles |
News |
Weblogs |
Books |
Forums

Artima Forums | Articles | Weblogs | Java Answers | News

Sponsored Link •

Java Community News
Why Processes Scale Better Than Threads

14 replies on 1 page. Most recent reply: Sep 6, 2006 10:08 PM by Gregg Wonderly

Welcome Guest
Sign In

Back to Topic List

Reply to this Topic

Search Forum

Threaded View


Previous Topic		Next Topic

Flat View: This topic has 14 replies on 1 page

Frank Sommers

Posts: 2642
Nickname: fsommers
Registered: Jan, 2002

Why Processes Scale Better Than Threads

Posted: Aug 30, 2006 10:03 AM

Summary
In a recent blog post, Assaf Arkin compares threads and independent processes, and suggests that most Java developers turn to threads to scale their application, whereas those working with PHP, Ruby, or other LAMP languages, use processes. He argues that processes scale better.

Assaf Arkin's recent blog post, Why Processes Scale Better Than Threads, contrasts the ways in which LAMP developers and Java developers build complex applications:

In the LAMP world, processes are everything. If you want to pull out data from a file, sort it, and e-mail the result, you pipe several programs together. You’re building a solution by assembling processes.

And for more complex tasks you add even more processes. Want to do things on a schedule? Fire them up with cron. Need to improve throughput? Start up a cache process. Monitor uptime? That’s another process for you.

By contrast, Java developers would run just one JVM process, and call into various APIs to accomplish those same tasks:

In Java you don’t scan files with grep, you use a library. You don’t pipe e-mails to sendmail, you use a library. All the features you need are folded into the VM.

Which turned a snappy VM into a huge behemoth that takes a couple of minutes to boot, as it’s setting up libraries, frameworks and containers. You don’t want to startup the JVM more than once.

To accomplish multiple concurrent tasks, Java developers would use threads, not independent processes. Arkin believes that these approaches result in different scalability characteristics of an application:

Multi-threaded developers tend to scale through objects, libraries and frameworks. When you focus on the components around you, you don’t pay much attention to anything outside the sandbox. The level of abstraction is the API.

Multi-process developers scale by assembling programs together, chaining them or running them in parallel. If it’s not in the framework, you look for a program (or combination of) that does what you need. The level of abstraction is the task...

The more independent processes you have, the easier they are to combine into new and interesting uses.

Because processes can easily be distributed across multiple servers, Arkin believes that solutions that center around the multi-process approach scale better horizontally (incorporating more servers), whereas the multithreading solution scales better vertically, and is able to take better advantage of a more powerful server.

Arkin's concluding point is that horizontal scaling—distributing workload across many less powerful servers—can result in more overall scale than distributing load to more threads in a single process on one powerful server. Potentially, the horizontal scaling approach is also more economical.

Do you agree with Arkin's conclusion that the multi-process approach scales better? And, if so, how do you architect Java applications to distribute workload among multiple processes?

Cameron Purdy

Posts: 186
Nickname: cpurdy
Registered: Dec, 2004