Stabilization: Use LazyLock API instead of helper methods

Rune

R ust UN der E macs

This project is an experimental Emacs core written in Rust. The project is still at a very early phase but has the following goals:

Bring multi-threaded elisp to Emacs
Be “bug-compatible” with existing Emacs Lisp packages (everything should still work)
Enable performance improvements (including faster GC, regex, and JSON) by leveraging the Rust Ecosystem.

See the design doc for more details.

Status

The current goal of this project is to create an editor MVP. We have a basic elisp runtime, and we are working on adding basic editing functionality in a minimal GUI. This will include:

buffer
text insertion/deletion
cursor
line wrapping
scrolling
file IO
display tables

If you want to contribute or have ideas for things to add, please open an issue.

lisp

Lisp files are currently pulled from

https://github.com/emacs-mirror/emacs/tree/emacs-29.1/lisp

Any modification for bootstrapping contain the tag RUNE-BOOTSTRAP.

Running

The easiest way to run the interpreter is with cargo run --profile=release. Running with the load argument (-- --load) will load the bootstrapped elisp and then exit. Running with the repl argument (-- --repl) will open an elisp repl. Running with both arguments (-- --load --repl) will load the elisp and then open the repl. Running with no arguments is equivalent to --load.

MIRI

Run the test suite with MIRI

MIRIFLAGS='-Zmiri-strict-provenance' cargo +nightly miri test

Exploring this repo

The project is defined by a main package rune, which depends on the crates included in the crates directory. One of those is the rune-macros crate, which defines the defun proc macro for defining builtin functions. The rest of the code is contained in src/. The modules are described below.

objects: The basic objects used in the interpreter. These are modeled after Emacs objects using tagged pointers with inline fixnums. Conversion between different primitives and object types is also found here.
reader: The emacs lisp reader that translates a string to a cons cell. Due to the simple nature of lisp syntax, the reader is hand rolled and does not rely on any parsing libraries.
env: The global obarray. Currently, function bindings are global and immutable and value bindings are thread-local and mutable. When the ability is added to share data between threads, this will enable new threads to safely run functions without the need to copy them.
gc: Contains the allocator and garbage collector. All code for rooting and managing objects lives here as well.
bytecode: The bytecode VM. This uses the same opcodes as Emacs and uses the bytecomp.el to compile.
interpreter: The basic elisp interpreter. This is used only to bootstrap the elisp byte-compiler.
fns, data, alloc: These modules contain definitions of builtin in functions. Some of these are just stubbed out until the functionality is actually needed.

Contributing

See the architecture doc for more info on the structure of Rust Emacs internals.

This project is moved forward by trying to load new elisp files and seeing what breaks. The best way to do that is with cargo run, which will load the currently bootstrapped files. The bootstrapped files are located in main.rs as part of the load function.

Usually what is needed is to implement more primitive functions. This is done with the defun macro. For example, if we wanted to implement the substring function, we would first look at the lisp signature.

(substring STRING &optional FROM TO)

Then we would translate the types to their Rust equivalent. If the correct type is not known we can use Object. In this example we would write our Rust signature as follows:

#[defun]
fn substring(string: &str, from: Option<i64>, to: Option<i64>) -> String {...}

If you run with cargo run -- --load --repl that will load the current bootstrapped files and then open the REPL. From there you can run (load "/path/to/elisp/file.el") to try loading a new file. Files that are not bootstrapped are not yet included in this repo, but are part of Emacs. Once the file is bootstrapped it can be added to the lisp directory.

Blog posts

tagged pointers in Rust: My initial approach to creating tagged pointers in Rust. It serves as in intro to this project.
implementing a safe garbage collector: An overview of the garbage collector used in this project and how Rust enables safe GC abstractions.

A vision of a multi-threaded Emacs :: Some ideas about how to add multi-threading to the existing language.

Design of Emacs in Rust: Some of the unique benefits that Rust could bring to Emacs.

Further exploration

Remacs: The original Rust and Emacs project. Remacs took the approach of enabling interop between Emacs C core and Rust, enabling them to replace parts of Emacs piecemeal. The project is currently unmaintained but is a big inspiration for Rune.
emacs-ng: The spiritual successor to remacs. This project integrates the Deno runtime into emacs, allowing you to write extensions in elisp or javascript. Which sounds cool if you happen to be a web developer. It really shows the power of integrating Emacs with a more modern ecosystem (which is part of the promise of Rust).
helix: A fast modern text editor written in Rust.
crafting interpreters: This was a big inspiration for this project, and it’s probably one of the best introductions to programming language implementations.

	pub(crate) fn bind<T, U>(&'ob self, obj: T) -> U
	where
	T: ConstrainLifetime<'ob, U>,
	{
	obj.constrain_lifetime(self)
	}

	quote! {
	let ptr = arena as *mut crate::arena::Arena;
	let val = #subr(#(#arg_conversion),*)#err;
	let arena: &'ob mut crate::arena::Arena = unsafe {&mut *ptr};
	Ok(crate::object::IntoObject::into_obj(val, arena))
	};

	macro_rules! root {
	($obj:ident, $arena:ident) => {
	let mut root = unsafe { $crate::arena::StackRoot::new($arena.get_root_set()) };
	let $obj = root.set($obj);
	};
	}

	macro_rules! element_iter {
	($ident:ident, $obj:expr, $gc:ident) => {
	let mut root_elem = None;
	let mut root_cons = None;
	$crate::make_root_owner!(owner);

	let mut gc_root_elem = unsafe { $crate::arena::RootStruct::new($gc.get_root_set()) };
	let mut gc_root_cons = unsafe { $crate::arena::RootStruct::new($gc.get_root_set()) };
	#[allow(unused_qualifications)]
	let list: $crate::object::List = $obj.try_into()?;
	if let $crate::object::List::Cons(x) = list {
	root_elem =
	unsafe { Some($crate::arena::Root::new($crate::arena::RootObj::default())) };
	root_cons = unsafe { Some($crate::arena::Root::new($crate::arena::RootCons::new(!x))) };
	gc_root_elem.set(root_elem.as_mut().unwrap());
	gc_root_cons.set(root_cons.as_mut().unwrap());
	} else {
	std::mem::forget(gc_root_elem);
	std::mem::forget(gc_root_cons);
	}
	#[allow(unused_mut)]
	let mut $ident = $crate::cons::ElemStreamIter::new(&root_elem, &root_cons, owner);
	};
	}

	impl<K, V> RootRef<HashMap<K, V>>
	where
	K: Eq + std::hash::Hash,
	{
	pub(crate) fn get<Q: ?Sized>(&self, k: &Q) -> Option<&RootRef<V>>
	where
	K: std::borrow::Borrow<Q>,
	Q: std::hash::Hash + Eq,
	{
	self.inner
	.get(k)
	.map(\|v\| unsafe { &(v as const V).cast::<RootRef<V>>() })
	}

	pub(crate) fn get_mut<Q: ?Sized>(&mut self, k: &Q) -> Option<&mut RootRef<V>>
	where
	K: std::borrow::Borrow<Q>,
	Q: std::hash::Hash + Eq,
	{
	self.inner
	.get_mut(k)
	.map(\|v\| unsafe { &mut (v as mut V).cast::<RootRef<V>>() })
	}

	pub(crate) fn insert<R: IntoRoot<V>>(&mut self, k: K, v: R) {
	self.inner.insert(k, unsafe { v.into_root() });
	}
	}

	let tmp: Object = resolved.into();
	root!(tmp, gc); // Root callable
	let callable: Callable = tmp.try_into().unwrap();

	pub(crate) trait IntoObject<'ob, T> {
	fn into_obj<const C: bool>(self, block: &'ob Block<C>) -> T;
	}

	let expect: Object = {
	let arena: &'ob mut Arena = unsafe { &mut (arena as mut Arena) };
	expect.into_obj(arena)
	};

	impl<'ob> AsRef<Object<'ob>> for RootRef<RootObj> {
	fn as_ref(&self) -> &Object<'ob> {
	unsafe { &(self as const Self).cast::<Object>() }
	}
	}

	pub(crate) trait WithLifetime<'new> {
	type Out: 'new;
	unsafe fn with_lifetime(self) -> Self::Out;
	}


	impl<'new, 'old, T: GcManaged + 'new> WithLifetime<'new> for &'old T {
	type Out = &'new T;

	unsafe fn with_lifetime(self) -> Self::Out {
	&(self as const T)
	}
	}

	macro_rules! rebind {
	($item:ident, $arena:ident) => {
	#[allow(unused_qualifications)]
	let bits: $crate::object::RawObj = $item.into();
	let $item = unsafe { $arena.rebind_raw_ptr(bits) };
	};
	}

	thread_local! {
	static SINGLETON_CHECK: Cell<bool> = Cell::new(false);
	}

	static GLOBAL_CHECK: AtomicBool = AtomicBool::new(false);

	impl Block<true> {
	pub(crate) fn new_global() -> Self {
	use std::sync::atomic::Ordering::Relaxed as Rel;
	assert!(GLOBAL_CHECK.compare_exchange(false, true, Rel, Rel).is_ok());
	Self {
	objects: RefCell::new(Vec::new()),
	}
	}
	}

	impl<const CONST: bool> Block<CONST> {
	pub(crate) fn new_local() -> Self {
	SINGLETON_CHECK.with(\|x\| {
	assert!(
	!x.get(),
	"There was already and active arena when this arena was created"
	);
	x.set(true);
	});
	Self {
	objects: RefCell::new(Vec::new()),
	}
	}

celeritascelery / rune Goto Github PK

rune's Introduction

Rune

Status

lisp

Running

MIRI

Exploring this repo

Contributing

Blog posts

Further exploration

rune's People

Contributors

Stargazers

Watchers

Forkers

rune's Issues

What should be done?

Why this task should be completed?

Acceptance Criteria

Who are the POCs or Stakeholders?

1 - current solution only get's us half way

2 - create a hashmap that is backed by indexes, then use the index for iteration

Why this is sound

meta characters

syntax aware matches

boundaries

Buffer Gap

Why this is sound

When to promote objects to the Major heap?

idea 1 - CSP all the way

Idea 2 - coroutine style

why not use async/await for the backend?

What should be shared and what should be thread-local?

how to handle variable bindings?

how to handle errors

Why this is sound

Why this is sound

raw byte encoding

display

solution 1 - Create custom Encoding format

Solution 2 - Use bstr and assume conventional UTF-8

What should be done?

Why this task should be completed?

Acceptance Criteria

Where can I find more details?

Who are the POCs/stakeholders?

1. store the hash value in the object header

2. rehash the keys during GC

3. Is there a possible third option?

Why this is sound

Why is this sound

Why this is sound

the current state of emacs

my thoughts

tracing vs method JIT's

why this is sound

Solution 1 - Embedded the header directly in the objects

Solution 2 - Store the header separately

Why this is safe

1 - current solution: RefCell

2 - copy on write

3 - unsafe

Why this is sound

possible solutions

Recommend Projects

Recommend Topics

Recommend Org