-
Notifications
You must be signed in to change notification settings - Fork 69
feat: Add CodeItem as pydantic type, update export methods and APIs #129
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Merge ProtectionsYour pull request matches the following merge protections and will not be merged until they are valid. 🟢 Enforce conventional commitWonderful, this rule succeeded.Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/
🟢 Require two reviewer for test updatesWonderful, this rule succeeded.When test data is updated, we require two reviewers
|
Signed-off-by: Matteo-Omenetti <[email protected]>
Signed-off-by: Matteo-Omenetti <[email protected]>
Signed-off-by: Matteo-Omenetti <[email protected]>
Signed-off-by: Matteo-Omenetti <[email protected]>
Signed-off-by: Matteo-Omenetti <[email protected]>
91a6650
to
378209e
Compare
Signed-off-by: Michele Dolfi <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
I think we should anyway consider having a stable enum for the programming languages. From the test about I see upcoming problems with Python vs python vs py, CSharp vs C# vs c#, etc |
Signed-off-by: Christoph Auer <[email protected]>
docling_core/types/doc/labels.py
Outdated
ADA = "Ada" | ||
AWK = "Awk" | ||
BASH = "Bash" | ||
C = "C" | ||
C_SHARP = "C#" | ||
C_PLUS_PLUS = "C++" | ||
CMAKE = "CMake" | ||
COBOL = "COBOL" | ||
CSS = "CSS" | ||
CEYLON = "Ceylon" | ||
CLOJURE = "Clojure" | ||
CRYSTAL = "Crystal" | ||
CUDA = "Cuda" | ||
CYTHON = "Cython" | ||
D = "D" | ||
DART = "Dart" | ||
DOCKERFILE = "Dockerfile" | ||
ELIXIR = "Elixir" | ||
ERLANG = "Erlang" | ||
FORTRAN = "FORTRAN" | ||
FORTH = "Forth" | ||
GO = "Go" | ||
HTML = "HTML" | ||
HASKELL = "Haskell" | ||
HAXE = "Haxe" | ||
JAVA = "Java" | ||
JAVASCRIPT = "JavaScript" | ||
JULIA = "Julia" | ||
KOTLIN = "Kotlin" | ||
LISP = "Lisp" | ||
LUA = "Lua" | ||
MATLAB = "Matlab" | ||
MOONSCRIPT = "MoonScript" | ||
NIM = "Nim" | ||
OCAML = "OCaml" | ||
OBJECTIVEC = "ObjectiveC" | ||
OCTAVE = "Octave" | ||
PHP = "PHP" | ||
PASCAL = "Pascal" | ||
PERL = "Perl" | ||
PROLOG = "Prolog" | ||
PYTHON = "Python" | ||
RACKET = "Racket" | ||
RUBY = "Ruby" | ||
RUST = "Rust" | ||
SML = "SML" | ||
SQL = "SQL" | ||
SCALA = "Scala" | ||
SCHEME = "Scheme" | ||
SWIFT = "Swift" | ||
TYPESCRIPT = "TypeScript" | ||
VISUALBASIC = "VisualBasic" | ||
XML = "XML" | ||
YAML = "YAML" | ||
BC = "bc" | ||
DC = "dc" | ||
UNKNOWN = "unknown" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These are partially sorted alphabetically, but not completely — I would best make the sorting consistent.
…f string but variable name
fa63048
to
86cbef2
Compare
…ocling-project#129) * added code item * added code item * added code item Signed-off-by: Matteo-Omenetti <[email protected]> * added code item Signed-off-by: Matteo-Omenetti <[email protected]> * added code item Signed-off-by: Matteo-Omenetti <[email protected]> * added code item Signed-off-by: Matteo-Omenetti <[email protected]> * added code item Signed-off-by: Matteo-Omenetti <[email protected]> * add constraints to allow numpy > 2.1.0 on python3.13 and others Signed-off-by: Michele Dolfi <[email protected]> * Add CodeItem to ContentItem Signed-off-by: Christoph Auer <[email protected]> * added CodeItem in ContentItem tagged union. * added enum for programming languages * removed double CodeItem in ContentItem Union * fixed type of code_language in CodeItem class * fixed sorting of programming languages, not sorted anymore by value of string but variable name --------- Signed-off-by: Matteo-Omenetti <[email protected]> Signed-off-by: Michele Dolfi <[email protected]> Signed-off-by: Christoph Auer <[email protected]> Co-authored-by: Matteo-Omenetti <[email protected]> Co-authored-by: Michele Dolfi <[email protected]> Co-authored-by: Christoph Auer <[email protected]>
No description provided.