Vitaly has posted 10 posts at DZone. View Full User Profile

Future in the Past: a Lightweight Java SE Runtime

01.27.2010
| 11971 views |
  • submit to reddit

In this article I share results we achieved after implementing a component deployment model, also known as JRE modularity, for the core of J2SE 5.0 and Java SE 6.  However, this is not a mere “Java gets smaller” message. Given that Project Jigsaw is emerging in JDK 7, I also offer some insights on the challenges that any implementation of modularity for the Java SE core may face, all derived from our practical experience.

“Divide and Conquer” Has Worked Out

We did not pioneer the idea that the monolithic Java SE platform needs to be split into components, it was in the air. We merely found a way of how to do that without breaking Java compatibility and implemented it in a compliant Java SE VM, Excelsior JET, back in 2007.

The goal was to let Java programmers bundle a light version of the Java Runtime with their applications leaving the unused components out so as to reduce the size of the installation package. Easier said than done, but we’ve got it made in a Java spec-compliant manner and called the technology Java Runtime Slim-Down (after Project Jigsaw appeared, it’s finally got to me that we should have to call it “Project Rock breaker” or the like).

It has been proved effective for many Java applications. For GUI applications, in particular, the size of a complete installation package with bundled Java Runtime starts from 5MB. In support of this assertion, I refer you to SWTPaint, a sample program taken from the latest Eclipse SDK.

 

The use of Java Runtime Slim-Down yields the results you won’t get with any other Java deployment tool:

  • the size of the SWTPaint installer is 5.5MB (this is a direct download link);
  • the installed application need not the JRE to run and does not download any components from the Internet (so it won’t disturb your firewall)

Note that the GUI application in question is written in Java and uses Java SE 6. Oh, sorry! I had to mention that download size of Swing applications starts from 8MB. Stirring the flame of Swing vs. SWT was by no means my intention. We prepared installation packages for a few sample applications, both Swing- and SWT-based, and you may download them from this page. If you are still in doubt, try the deployment technique yourself. This flash demo will help you get started.

There are good reasons for end users to love “all-inclusive” installation packages at reduced footprint rates, and it’s where a lightweight Java Runtime is of much help. At a larger scale, however, the lack of the JRE modularity impeded the evolution and adoption of the Java SE platform.

In between a Rock and a Hard Place

A bit of history. Remember JSR-83, a proposal on the “multiarray” package originated by IBM. Its implementation could have had a great impact on the number crunching performance in Java. Nevertheless, it was approved only as a Java Standard Extension, never appeared among the core packages and eventually was withdrawn. In the final ballot, Sun made a noticeable comment: “...The proposal requires at least 82 new classes, and this seems inappropriate for the J2SE core...”.  Though I personally was disappointed with the outcome, the need to damp the Java core inflation down sounded reasonable.

Other JSRs were more lucky and the Java Community Process kept Java SE moving forward over the years. In 2006, I attended the Java Licensee Day event. During the Q&A part of the session devoted to then new Java SE 6, one of the licensees sharply asked: “With each release the JRE gets bigger and bigger. Our customers do not need all those new APIs. When will it stop bloating?” I then found myself thinking I agreed with him but I would rather say “Not all our customers need all those new APIs...”.

The question is how many “useful” APIs have not been approved just not to make the JRE bigger? One may ask also how many “useless” APIs have been approved and did make the JRE bigger? It’s clear that requirements of different projects vary and there is no single answer to these questions, but splitting the JRE into components could resolve these issues gracefully.

Here’s a practical example from our support records. We have customers who previously got stuck with J2SE 1.4.2 simply because the footprint of later Java versions was unacceptable for their deployment requirements. Now, after switching to the component model, they are happy users of Java SE 6.

However, I would not like to discuss the Sun’s policy on modularizing Java SE here. There were many pros and cons to consider, both technical an legal, and I fully realized some of them only when working on the Java Runtime Slim-Down technology.

No shooting (in the foot)

As it often happens, once we had started the design, the scope of work suddenly increased. For the truth to be told, we would not reach the goal by simply splitting the Java SE API into components and enabling the user to drop some of them. The big question was how to make the technology usable and reliable? After all, we did not want to create a thing that does not work just because the programmer removed some components too aggressively. We decided to explore the limits of this approach by interviewing those enthusiasts who pursued us for this matter. The results confirmed our suspicion – programmers are not always aware of what parts of the Java SE API are actually used in their applications. A good illustration would be the following transcript written then:

Client.: I do not use that “Baggage-To-Trim” API and no longer want to carry it with my app. As you are a JVM vendor, make me happy, please.

Support Engineer: We understand you concern. Are you sure you don’t use the “Baggage-To-Trim” API?

Client: Absolutely.

Support Engineer. We kindly ask you to double check it. Please run your app with java –verbose:class and inspect the log.

Client: Oops.. You surprised me! It’s proved to be used. Frankly speaking, I did not write that code where it’s used. Let me think of it.

Needless to say that we also had to think of it. In addition, Java SE components may depend on each other implicitly, via the implementing classes, and  most programmers not familiar with the internals could not play safe when removing the components.

We conducted some R&D and figured out that such a technology should come with tools that help the user not shoot himself in the foot and rules which, just in case, provide the fastest recovery.

Tools and Rules

The final solution included a dependency analyzer and “safety net”. The analyzer takes the application’s classes, infers what Java SE components are likely in use and advises to the user.

Under the covers, it’s not simply checking import dependencies as that would work poorly in terms of precision. For instance, such a simplistic analysis would not have revealed that the SWT-AWT bridge, which is part of the SWT package, is not used by the SWTPaint application mentioned above. As a result, the AWT component would be sucked in and the installation size would increase. That said, analyzer design and testing had engaged us for some time.

Does it guarantee that any deployed application will never miss the removed components? I would not lay money on it. After all, a programmer could detach some components by mistake or an application may load a plug-in that uses the Java SE API more extensively than the application itself. Here the following rule comes into play. All removed components are put into a detached package and the developer has to place it on a Web server at the URL s/he assigned when creating the installation. The Web server is considered a "safety net": should the deployed application attempt to use any of the removed components, the Java Runtime will pull the package down from the server and load the requested Java classes.

On the formal side of things, we run the Java Compatibility Kit (JCK) deployed in this mode with some and all Java SE components detached. Noteworthy is that all the tests pass.

However, it is unlikely that a download of a detached package will occur in practice, provided the developer listened to the word of wisdom from the analyzer. For example, these sample applications have been downloaded over a thousand of times since we published them in May 2007, but there was not a single download of a detached package so far.

The last note is about the splitting. We have managed to carve a kernel part of Java SE, about 4MB, that have to be bundled with any application. We could not get it smaller. We wanted to break down the whole thing into more components, each of a smaller size, but were unable to do that. In general, we could obtain better download size figures if the Java SE API implementation classes would not be so tightly coupled, full of cross-references, strongly connected, melted and fused together.

The truth to be told about Sun Java Kernel

Initially, the Consumer JRE was supposed to include a deployment technology for reducing the download size. But what appeared in Java 6 Update 10 under the name Java Kernel is still far from the solution.

The Java Kernel contains the VM and some core classes like java.lang.*, java.io.*  and meets the needs of HelloWorld of sorts only. Upon application launch, the Java Kernel inevitably starts downloading the remaining packages from a Sun web site and no means are provided to package the required bundles with the application. In essence, the end users "download a downloader" and all you can do with it is specify which missing bundles must be downloaded first.

This short table highlights the key differences between Java Runtime Slim-Down and Java Kernel:

Question of the Day

Java Runtime Slim-Down

Sun Java Kernel

Is it possible to create a custom bundle with only the required components for the application installer?

YES

NO

Can I host the server-side infrastructure on my own web site?            

YES

NO

When running a Java application, the dialog 'Installing additional components' is displayed repeatedly. Is this expected behavior?

NO

 YES

You may find more details in Java Kernel FAQ. Probably, the Java Kernel is just a preliminary step towards a solution that may appear in the future.

Historical Notes

The first mention of the JRE modularity being found in the Annals relates to the times of JDK 1.2(!) The discussions lasted for years, the first implementation in the Sun JRE was planned for Java 1.5, then moved to Java 6. The Java Kernel appeared in Java 6 Update 10 proved to be far from the solution.

In parallel with the Java Kernel, Java Module System (JSR-277) was in the works, which, in particular, could address the JRE modularity problem but the deadlock between Sun and an OSGi lobby had it buried.

Finally, in the end of 2008, Project Jigsaw was announced and “Episode IV. The New Hope” commenced.

Four Challenges for Jigsaw

I’m not with the Java SE core group at... at the company that drives the Java development but I’ve been working with the Java SE core for more than ten years. Below I share my opinion on the main challenges to the adoption of Jigsaw in the future.

World peace. The corporate battle around OSGi needs a break. None has won and the Java Community has lost because, time after time, JRE modularity solutions appear to disappear. To use or not to use OSGi now is less of an issue as compared with the next challenge.

Backward compatibility. In modular approach, each component should be sufficiently isolated and import relations should be declared statically. Besides that, import graph should be (close to) acyclic to minimize the number of indirectly used components.

The problem is that the reference implementation of the Java SE API was coded without having modularity in mind. Somewhat it was a side-effect of the Java’s lazy classloading which created an illusion that a use of any class in Java code costs nothing until it’s actually executed at run time. It’s proved to be a technical debt and now is the time to pay the interest.

In practice, it means that spaghetti-like dependencies between the implementing classes are omnipresent and breaking the ties without breaking backward compatibility with previous Java versions is double tough. The danger is to create something like Apache Harmony: everything is implemented with an elegant internal architecture, samples work, but existing Java apps have issues.

Profiles all over again? Some outlines of Jigsaw mention so called Java SE Profiles, e.g. Profile for headless apps, for basic RIA, for rich desktop apps, etc. A potential threat here is the repetition of the same old story of the monolithic JRE unless user-defined profiles will be allowed. One size does not fit all even if the presets are defined carefully.

Need for total modularization. There is a risk to get no benefits from the JRE modularity alone. Most real world applications use third-party components (Java APIs), which are yet to be modularized. Typically, applications tend to use only a part of the functionality provided by an API. Without total modularization, chances are good that unnecessary parts of (modularized) JRE will be taken in due to use of a (monolithic) third-party component. And it’s no matter whether the import declarations will be written in OSGi bundle manifests or somewhere else.

Conclusion

In closing, I’d like to say that we implemented Java Runtime Slim-Down due to high demand from our clients. I strongly believe, however, that modularized JRE has to come with Java SE out-of-the-box, not just as a vendor solution. I wrote this article with a little hope that our past experience would be of some help for the future development of the Java platform.

P.S. Steve Jobs once said that Java is a big heavyweight ball and chain. Good news: the ball is now optional! You may detach it and use module Chain only.

Published at DZone with permission of its author, Vitaly Mikheev.

(Note: Opinions expressed in this article and its replies are the opinions of their respective authors and not those of DZone, Inc.)

Tags:

Comments

Osvaldo Doederlein replied on Wed, 2010/01/27 - 11:54am

Excellent writeup. Do you have any comment in the effort that Sun has been making recently to reduce dependencies? I track JDK7's/OpenJDK's bugs/changeset comments, and in the last six months at least, there were many changes of that kind - some lazy loading here, some wrapping or other trick there, so when applications load API A it doesn't bring up unrelated API B immediately. I specifically remember a big change to avoid initializing the java.util.logging API, which apparently is used everywhere internally, but in practice seldom needed (loggers not enabled).

I suspect that the changes performed so far are mostly the lower-hangling fruit, or the most critical changes to trim down JRE's base classloading for typical JavaFX apps (FX is driving a lot of JDK improvements recently, which is easy to understand). Jigsaw promises to really untangle the depends in the core, and should have additional, riskier refactorings. But we already have JDK 6u18, which contais virtually all such enhancements performed recently (usually implemented first in JDK7 and then backported). JET 7 is at JDK 6u16, I wonder if you're already working on 6u18 and if so, if you've noticed any significant gains in modularity for JET's packaged/slimmed-down apps.

Wrt the total modularization issue, I expect that once JDK7 is out, providers of important third-party libraries will start preparing Jigsaw modules of their jars, too. I expect this to happen soon at least for professionally maintained projects (commercial libs; free ones from large groups like RedHat/JBoss, Apache etc.). of course, the first versions may be "jigsawed" in a hurry, so you get a single big 10Mb module that depends on the entire JavaSE. May have towait a couple years down the road until everybody identifies and fixes their own dependency spaghetti too.

Jan Kotek replied on Thu, 2010/01/28 - 9:16am

Nice, but 10 years after it was needed :-(

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.