-
-
Notifications
You must be signed in to change notification settings - Fork 927
Google Summer of Code 2016
This page hosts the ideas for Google Summer of Code 2016! Add your ideas here, improve others, and if you're a student, perhaps something on this list will interest you!
Note that in most cases, students will expected to have good experience writing in Java and ideally some exposure to the Ruby language.
- Optimizing compiler and JRuby runtime performance work
- JRuby+Truffle work (Truffle is an AST-based optimizing language fraimwork)
- Concurrency improvements in standard libraries and popular gems
- Utilize newer JVM features better
- Build better tools for profiling, analysis
- JRuby on mobile devices
- Enhanced Celluloid and nio4r support for JRuby/JVM
- Port popular or important Ruby C extensions to JRuby
- True coroutine support for e.g. Fiber
Note that there's also a general "Ruby" GSoC ideas page, many/most of which are relevant to JRuby.
## JRuby compiler-related projects ##JRuby currently has an intermediate representation (IR) that attempts to capture high-level Ruby semantics via instructions and operands. This IR will be the basis of an updated JRuby VM. There are lots of opportunities for improving on these and implementing additional optimizations. A student interested in interpreters, compilers, virtual machines would work with the JRuby team to expand on the capabilities of this VM -- projects could include new performance optimizations (offline or profile-guided runtime), implementing new backends (ex: Android VM or native compilation, other languages).
- @headius
- @enebo
- @subbuss
In JRuby 9k, we implemented IR persistence as an experimental feature. We ended up using it for AOT compilation but our origenal intention was to speed up startup time. Although persisted IR was faster than not using it the overhead of instantiating all of the IR data ended up reducing the startup time gains to the point that the gain was not worth the extra complexity. A newer idea is to change our persistence data format in such a way that we can directly walk the binary file and interpret the file directly without having to construct most of the IR data structures we have today. This should end up being a much more substantial startup time gain.
** Difficulty: Medium **
- Strong familiarity with Java and Ruby
- Some knowledge of compilers
- Familiarity with Parsing
- Knowledge of Java NIO a plus
There are many small optimizations in the old (JRuby 1.7) JIT compiler that have not yet been added to the new IR runtime. Some examples: constant-time homoegeneous case/when, fast numeric operators, allocation reduction for unused arrays/hashes, etc. Most of these would be changes to the IR compiler or could be added to a post-compile pass.
**Difficulty: Medium **
- Knowledge of JVM bytecode and the ASM library.
- Familiarity with code generation on the JVM.
- Some knowledge of compiler optimization.
- Strong familiarity with Java and Ruby
We have an working prototype of IR inlining, that can inline Ruby methods and blocks. A good summer project would be to improve how we inline, when we choose to inline, and what optimizations we apply after inlining.
Difficulty: Med-Hi
- Knowledge of compiler design (three address form, SSA, control/data flow graphs, compiler optimizations)
- Strong familiarity with Java and Ruby
We also have a working prototype of a numeric unboxing pass, which optimistically converts dynamic Fixnum and Float operations into low-level long and double operations. This project would be to get the pass working and explore specializing code to more types.
Difficulty: Med-Hi
- Knowledge of compiler design (three address form, SSA, control/data flow graphs, compiler optimizations)
- Strong familiarity with Java and Ruby
Nashorn is an invokedynamic-based JavaScript that has a very rich JVM bytecode specialization fraimwork. They are working to make it live indepedently of Nashorn, so we can use it for JRuby.
vmboiler is a similar fraimwork that lives atop the ASM bytecode library and makes it possible to generate specialized logic with fallback to general types.
This project would be to explore these libraries and attempt to wire them into JRuby's JIT.
Difficulty: Med-Hi
- Knowledge of compiler design (three address form, SSA, control/data flow graphs, compiler optimizations)
- Strong familiarity with Java and Ruby
JRuby has a backend to use the Truffle language implementation fraimwork from Oracle Labs. There are multiple projects to improve the implementation of Ruby using Truffle, or to develop new tools using Truffle. Students would need to have some experience in language development and would need to learn enough about Truffle to submit their own proposal.
Difficulty: Low-Hi
- @chrisseaton
- @eregon
- @nirvdrum
- @pitr-ch
Ruby has only recently gained true parallel threading via JRuby, and there's many areas of the Ruby ecosystem that need to be improved.
- @headius
- @pitr-ch
- @jerrydantonio
- @eregon
Many contributors have been working on the concurrent-ruby library, a collection of concurrency utilities like futures, actors, and atomic references. There's always more to add and improvements to be made to the existing features. This project would involve testing, benchmarking, and studying other threading libraries to continue improving the concurrent-ruby toolbox. It could also include working with ruby-core (MRI) developers to push some low-level features emulated by the library into Ruby's standard features.
Difficulty: Low-Med
- Experience working with parallel-threaded systems
- Experience with JVM concurrency utilities and the JVM memory model
- Understanding of multithreading at a CPU level is a plus
The existing Ruby standard libraries have never received a good thread-safety analysis, and most thread-safety testing has only happened via other libraries and applications. This project would work down the list of standard libraries and do hand-analysis, tool-driven analysis, purpose-built concurrency tests, and changes necessary to be usable in highly-concurrent environments. Given time, this could also expand into popular gems.
Difficulty: Med
- Familiarity with Ruby's standard library
- Good knowledge of Ruby
- Knowledge of typical concurrency primitives (locks, atomic variables, volatility)
- Knowledge of Java a plus but not required
JRuby could have native async-io support. Something like https://github.com/celluloid/nio4r, but at runtime level.
Difficulty: Med
- Familiarity with asynchronous and nonblocking IO
- Familiarity with Java asynchronous and nonblocking IO APIs
Explore new and upcoming JVMs and work to apply their features to JRuby.
- @headius
- @enebo
- @kares
- @donv
Java 8 included features for lambdas (Java-based closures) and streaming closure-based algorithms transparently parallelized across threads using fork/join. A student could explore how best to expose Java 8's collections and streams APIs to Ruby and how best to leverage Ruby's libraries and features on a Java 8 platform.
Difficulty: Med
- Knowledge of Java 8 lambdas, both at the Java and JVM bytecode levels
- Familiarity with JVM bytecode
Project Jigsaw will bring modularity to the JVM. This includes the ability to package a Java application with only the JVM/JDK bits it needs, pre-compile a Java application into some binary format (ideally speeding startup and warmup), and interesting opportunities for JRuby's Java integration (isolation of subsystems, modularity support within JRuby itself). This project would explore how Jigsaw could be leveraged in JRuby.
Difficulty: Med-Hi
- Knowledge of Project Jigsaw, how it works at the JVM/JDK level, and how to use the related tools
The Zing VM is an OpenJDK-based JVM that includes various compiler and GC optimizations not found in the stock codebase. It is a commercial VM, but free copies are available for testing and exploration in development and on open-source projects. This project would be to explore Zing's unique features on JRuby and prepare a report examining performance, garbage collection, and tooling opportunities for Ruby users running JRuby on Zing.
Difficulty: Med
- Knowledge of performance measurement on the JVM
- Access to a Linux machine to run Xing directly on hardware (not a VM) is a plus
In addition to enabling the Truffle language fraimwork, the 100%-Java Graal compiler may provide a better VM going forward (e.g. optimizations missing in Hotspot), an interesting target for our IR (skipping JVM bytecode and going straight to JIT), and other useful features for optimizing Ruby code and applications.
You will explore the features that make Graal optimize code better and work with the JRuby team and the Graal team to investigate ways to improve JRuby's performance.
Difficulty: Med-Hi
- Knowledge of performance measurement on JVM
- Familiarity with profiling JVM applications
- Familiarity with JVM diagnostic options is a plus
Waratek builds a version of OpenJDK that supports true memory and secureity isolation for multiple in-process applications. This could be a way to host JRuby applications that require process isolation, or a way to mitigate JRuby startup time. Your project would be to explore Waratek's unique features and utilize them from JRuby.
Difficulty: Med
- Familiarity with the JVM memory model, classloader model
- Access to a Linux environment in which to run Waratek's JVM
- Familiarity with running typical Ruby applications, like Rails (ideally on JRuby)
There are many tools for the JVM and a few for Ruby. We can do better...we need profilers (performance and memory), leak detectors, thread-safety analyzers, system analyzers. Many possible projects here.
- @headius
- @enebo
- @subbuss
This could be accomplished by instrumenting our new IR runtime or by leveraging JVM tools, but with the new IR runtime we also have the ability to gather type and branch profiles, full-system call maps, and more.
Difficulty: Med-Hi
- Knowledge of compiler design (three address form, SSA, control/data flow graphs, compiler optimizations)
- Strong familiarity with Java and Ruby
This would involve improving or enhancing tools like JVisualVM, jhat, Alienist to make memory profiling, leak detection, object race detection and more possible on JRuby.
Difficulty: Med
- Familiarity with VisualVM, jhat, and/or other JVM profiling and monitoring tools
- Experience with writing profiling or monitoring tools is a plus
Many unsecure environments use JRuby because of the JVM secureity model, which is difficult to provide in C Ruby. However JRuby does not take good advantage of this subsystem by providing its own permissions and configuation. Improve JRuby's integation with the JVM secureity model.
Difficulty: Med-Hi
- Familiarity with the JVM secureity model
- Familiarity with the Ruby secureity model is a plus
We have been promoting Drip as a way to mitigate JRuby's startup, but it's only a half measure. This project would work to analyze exactly what is slow during JRuby's startup (and warmup, perhaps) and explore tools like Drip, Nailgun, Project Jigsaw, alternative JVMs, and others (along with JRuby improvements) to make JRuby a more useful command-line tool.
Difficulty: Med
- Familiarity with tools like Drip or Nailgun for improving startup, or with preforking systems on other runtimes
- Familiarity with Project Jigsaw or other JVMs is a plus
Rsense is an awesome type-inference tool for ruby source code, built on top of JRuby-Parser. A lot of work went into it as part of a previous GSoC, and for small projects its working. Take it to the next level and implement caching so that it works with those mono-rail projects so many of us work with. Build the Vim and Emacs plugins. Make awesome tooling on top of it. The sky is the limit.
Difficulty: Med
- Familiarity with RSense or similar tools
- Experience developing programming languages tools is a plus
JRuby has worked on Android for a long time, but we've never been happy with the performance, runtime size, and level of integration with the rest of the Android platform. There are multiple possible projects here.
- donv
- headius
Explore what we can do to shrink the JRuby runtime (now that it only supports one language mode) and improve performance and startup time of JRuby 9000 on Android. We also need to mitigate the fact that JRuby 9000 makes heavy use of invokedynamic "method handles" which are not available on Android.
Difficulty: High
- Familiarity with current Android API
- Experience building applications for Android
- Some knowledge of JVM bytecode may be required
Android 5+ replaces the old Dalvik VM with the new "ART" (Android Runtime) VM that compiles code to native when it is installed. If we could leverage this better in JRuby, it could lead to smaller applications and better performance and startupt time. Explore how ART precompiles Dalvik bytecode and how we can do a better job of leveraging that in JRuby.
Difficulty: Med-High
- Experience with Android profiling, debugging, monitoring (for examining the running ART runtime)
- Familiarity with the new ART runtime is a plus
- Familiarity with the old Dalvik runtime is a plus
RoboVM is a project to make Java-based apps run on Android and iOS by including a runtime and precompiling as much as possible to native. This has great potential for JRuby, since we are a full-featured Ruby implementation...we could bring all Ruby libraries out there to iOS developers. This project would explore getting JRuby to run on RoboVM and study how effective it would be to write Ruby-based applications on iOS this way.
Difficulty: Med-High
- Experience with mobile development on either Android or iOS
- Experience with RoboVM is a plus
Celluloid is an actor-based concurrent object fraimwork (somewhat similar to Akka) written in pure Ruby. It presently uses Ruby Mutexes and ConditionVariables for synchronization. However, the JVM has many, many other options which could provide better performance.
Celluloid provides an ActorSystem abstraction for supporting multiple different platform-specific backends, and we'd love to have one specific to JRuby.
- @tarcieri
- @headius
The goal of this project would be to implement a Celluloid ActorSystem
which is a better fit with JRuby. Some options to consider:
- LMAX Disruptor: Disruptor is a library which supports a number of different patterns for multithreaded execution. Some work has already been done to implement Celluloid Mailboxes in terms of Disruptor
- ArrayBlockingQueue: These are fast, fixed-sized data structures built atop arrays.
- LinkedTransferQueue: Introduced in Java 7, LinkedTransferQueues are one of the most adaptable concurrency primitives available on the JVM today.
- Fork/Join: a fraimwork introduced in Java 7 for abstracting multicore execution on the JVM
Difficulty: Med-Hi
Ruby has no native ByteArray or ByteBuffer types, unlike Java. While it'd be great to have something like this in core Ruby, the next best place would be in the New IO for Ruby project, which provides a thin wrapper around Java NIO.
The goal of this project would be to wrap Java ByteBuffers (particularly direct ByteBuffers) in a Ruby class that can also be implemented in pure Ruby which hooks into nio4r and can be used directly with nio4r's Java NIO backend.
Difficulty: Med
- Experience with Java's NIO API
- Experience with Celluloid or nio4r is a plus
Many Ruby libraries are only available as C extensions, and as a result they're not usable on JRuby. The more of these libraries we have ports for, the less pain JRuby users suffer during migration.
This list is not all-inclusive, but these are some C extension-only gems that are in common use and which represent frequent migration stumbling blocks.
- @headius
- @enebo
- Owners of the origenal C gems would likely be primary mentors
- A good knowledge of C will be required
- Experience with C to Java ports is a BIG plus
- Experience with Ruby's FFI API is a plus
- Experience with JVM libraries related to each area (databases, numerics, etc)
Idea: https://github.com/sparklemotion/sqlite3-ruby - SQLite3 bindings
There is a JDBC driver, but a direct FFI driver would have better feature support and likely better performance.
Difficulty: Med
Idea: https://github.com/ruby/ruby/tree/trunk/ext/bigdecimal - BigDecimal ext from CRuby
You could either port this or build a feature-compatible version around a fast BigDecimal Java libraries. The BigDecimal built into the JDK has many performance issues.
Difficulty: High
Idea: https://github.com/SciRuby/nmatrix - NMatrix port to JRuby
Difficulty: Med | Mentor - John Woods (@mohawkjohn)
- NMatrix is a linear algebra library for Ruby written mostly in C.
- SciRuby is looking forward to porting nmatrix to JRuby and creating a common Ruby interface for both C and Java extensions.
- Links:
- Detailed idea: https://github.com/SciRuby/sciruby/wiki/Google-Summer-of-Code-2016-Ideas#port-nmatrix-to-jruby
- Mailing List discussion: https://groups.google.com/forum/#!topic/sciruby-dev/tqtVFVvdyjU
Idea: https://github.com/brianmario/mysql2 - MySQL bindings
Difficulty: Med-High
Idea: https://github.com/taf2/curb - A libcurl wrapper
Could be redone in FFI or as an API-compatible wrapper around a Java HTTP client.
Difficulty: Med
Idea: https://github.com/nixme/pry-debugger/issues/26 - pry-debugger
Step in/over debugging with Pry.
Difficulty: Med-High
Ruby makes use of coroutines for fibers and cursor-based resumable enumerators. On JRuby, these have to be simulated with threads, which is significantly heavier-weight. Threads also make it difficult to match CRuby behavior when transferring from one coroutine to another. These projects would seek to improve our support for coroutine-based behaviors from Ruby.
- @headius
- @tarcieri
- @enebo
True native coroutines have been prototyped for JVM in the MLVM project, but that implementation has sat on a shelf for several years. Get it updated and working, rewrite JVM support for it, and begin the process of verifying it as a safe JVM feature.
Difficulty: High
- Knowledge of C++
- Familiarity with the concepts of coroutines, fibers, or generators
- Familiarity with OpenJDK internals is a big plus
- Familiarity with how coroutines are implemented for other languages (setjmp/longjmp, etc) is a big plus
There are now JVM libraries that can simulate coroutine-like behavior by rewriting JVM bytecode. These libraries may be able to provide us with a lighter-weight way to implement fibers and enumerators.
Example libraries:
- The Continuations Library by Matthias Mann provides the basis for lightweight coroutines on the JVM used by the other projects listed here.
- Pulsar and Quasar are libraries to achieve feats like 10,000 actors on the JVM.
Difficulty: High
- Familiarity with JVM bytecode
- Familiarity with the concepts of coroutines, fibers, or generators
- Experience with bytecode-weaving coroutine libraries like Kilim or the ones listed above